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International filing date (day/month/year) 

09/05/2000 


(Earliest) Priority Date (day/month/year) 
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This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 1 8. A copy is being transmitted to the International Bureau. 



This International Search Report consists of a total of 



sheets. 



PH It is also accompanied by a copy of each prior art document cited in this report. 



1. Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : 

~| contained in the international application in written form. 

I filed together with the international application in computer readable form. 

furnished subsequently to this Authority in written form. 

furnished subsequently to this Authority in computer readble form. 

f^l the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

PH the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 



2. 
3. 



PH Certain claims were found unsearchable (See Box I). 
I | Unity of Invention Is lacking (see Box II). 



4. With regard to the title, 

|X| the text is approved as submitted by the applicant. 

I | the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

T] the text is approved as submitted by the applicant. 

| the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
within one month from the date of mailing of this international search report, submit comments to this Authority. 

6. The figure of the drawings to be published with the abstract is Figure No. 

□ as suggested by the applicant. Q None of the figures. 

I I because the applicant failed to suggest a figure. 

I I because this figure better characterizes the invention. 
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PAMELA S (US); SINHA SUKANT0 (US); TAN 
HUA) 19 December 1996 (1996-12-19) 
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"A" document defining the general state of the art which is not 
considered to be of particular relevance 

■E' earlier document but published on or after the international 
tiling date 

" L " d 2i um . ent ^""5^ may dou bts on priority claim(s)or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed y 

Date of the actual completion of the International search 

4 October 2000 

Name and mailing address of the ISA 

S rop 252fSt?2, C),flce ' Pa 5818 PatentJaan 2 
NL - 2280 HV Rijswijk 

Tel. (+31-70) 340-2040. Tx. 31 651 epo nl, 

Fax: (431-70) 340-3016 

Form PCT/ISAB10 (second sheet) (July 1992) 




|X I Patont fen^y members are listed in annex. 

T later document published after the international filing date 
or priority date and not in conflict with the application but 
P^j? understand the principle or theory underlying the 

•X" dcxxjment of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
m ^ a |juch combination being obvious to a person skilled 

document member of the same patent family 
Date of mailing of the international search report 

17/10/2000 

Authorized officer 
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Date of mailing 

(day/montWyear) 27.07.200 1 


Applicants or agent's file reference 
X- 11600 


IMPORTANT NOTIFICATION 


International application No. 
PCT/USOO/06707 


International filing date (day/month/year) 
09/05/2000 


Priority date (day/month/year) 
11/05/1999 


Applicant 

ELI LILLY AND COMPANY et al. 



1. The applicant is hereby notified that this International Preliminary Examining Authority transmits herewith the 
international preliminary examination report and its annexes, if any, established on the international application. 

2. A copy of the report and its annexes, if any, is being transmitted to the International Bureau for communication 
to all the elected Offices. 

3. Where required by any of the elected Offices, the International Bureau will prepare an English translation of the 
report (but not of any annexes) and will transmit such translation to those Offices. 



4. REMINDER 

The applicant must enter the national phase before each elected Office by performing certain acts (filing 
translations and paying national fees) within 30 months from the priority date (or later in some Offices) (Article 
39(1)) (see also the reminder sent by the International Bureau with Form PCT/IB/301). 

Where a translation of the international application must be furnished to an elected Office, that translation must 
contain a translation of any annexes to the international preliminary examination report. It is the applicant's 
responsibility to prepare and furnish such translation directly to each elected Office concerned. 

For further details on the applicable time limits and requirements of the elected Offices, see Volume II of the 
PCT Applicant's Guide. 
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D-80298 Munich 

Tel. +49 89 2399 - 0 Tx- 523656 epmu d 
Fax: +49 89 2399 - 4465 
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Buchler, S 
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Applicant's or agent's file reference 
X- 11600 



International application No. 
PCT/US00/06707 
International Patent Classification (IPC) or national classification and IPC 
C07K14/47 



See Notification of Transmittal of International 
FOR FURTHER ACTION Preliminary Examination Report (Form PCT/1PEA/416) 



International filing date (day/month/year) 
09/05/2000 



Priority date (day/month/year) 
11/05/1999 



Applicant 

ELI LILLY AND COMPANY et al. 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of 6 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the descriptor ^claims ^^^^^ 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



I 




II 


□ 


III 




IV 


□ 


V 


Si 


VI 


□ 


VII 


□ 


VIII 





citations and explanations suporting such statement 




Date of submission of the demand 



13/11/2000 



Name and mailing address of the international 
preliminary examining authority: 

^- European Patent Office 

Xjft D-80298 Munich 

Tel. +49 89 2399 - 0 Tx: 523656 epmu d 

Fax: +49 89 2399 - 4465 



Date of completion of this report 



27.07.2001 
Authorized officer 

Loubradou, G 

Telephone No. +49 89 2399 8543 
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INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US00/06707 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70.17)): 
Description, pages: 

1 -72 as originally filed 

Claims, No.: 

1 -22 as originally filed 

Sequence listing part of the description, pages: 

1-11, filed with the letter of 07.08.2000 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 
K furnished subsequently to this Authority in written form. 

furnished subsequently to this Authority in computer readable form. 

E3 The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

E3 The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 
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6. Additional observations, if necessary: 

Inventive step and Industrial applicability 
,„. «— «- .Pinion wHh ^ „ involve „ ,„ven«.e step <to oe non- 

□ the entire international application. 
3 claims Nos. 21 , 22. 
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so inadequately supported by the description that no meaningful opinion 
D the Cairns, or said claims Nos. are so madequately 
could be formed. 

. nnrt has been established for the said claims Nos. 21 , 22. 
B no international search report has been 
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Instructions: 

-in, inventive step or industrial applicabilrty; 
«.2l with regard to novelty, invenuve t. y 

1. statement 

Yes- Claims 2,3,5-16,18-20 
Novelty (N) m c|aims 1,4,17 

■ ,„ D MS> Yes- Claims 2.3,5-16,18-20 

Inventive step (lb) 



PCT/IPEA/409 (Boxes l-VHI. Sheet 2) (July 
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International application No. PCT/US00/06707 



No: Claims 1,4,17 

Industrial applicability (IA) Yes: Claims 1-20 

No: Claims 



2. Citations and explanations 
see separate sheet 



VIII. Certain observations on the international application 



see separate sheet 



Form PCT/IPEA/'409 (Boxes l-VIII, Sheet 3) (July 1998) 
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Reference is made to the following documents: 

D1 : WO 96 40885 A (MCCONLOGUE LISA C ;KEIM PAMELA S (US); 
SINHA SUKANTO (US); TAN HUA) 19 December 1996 (1996-12-19) 

D2: DATABASE GENEMBL [Online] 6 June 1996 (1996-06-06) HILLIER 
ET AL.: 'yx480g8.r1 soares melanocyte 2NbHM Homo sapiens cDNA 
clone IMAGE:265022 5* similar to contains Alu repetitive element' 
XP002 148339 



Re Item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 



1. D1 discloses the purification of a p-secretase protein. The starting material of D1 
is identical to the starting material the present application, the p-secretase activity 
assay are similar and several purification steps are shared with the purification of 
the present application (see D1 pages 28 to 30 paragraph entitled "Purification 
and characterisation of p-secretase"). The partial sequences obtained from the 
protein of D1 seem not to be present in SEQ ID NO:2 or SEQ ID NO:4 of the 
present application therefore the proteins having said SEQ IDs appear to be novel 
with regard to the protein of D1 . However the present claim 1 is drafted so broadly 
that its scope encompasses any protein being a "functional equivalent" of the 
proteins having said SEQ IDs without any limitation in sequence requirement (see 
also paragraph 5.). Therefore D1 is prejudicial for the novelty of claim 1. 

2. The attention of the applicant is drawn to the fact that the SEQ ID NO:2 comprises 
a stretch of 27 undetermined amino acids. This implies that the statement "a 
fragment of at least 6 continuous amino acids thereof" in claim 1 encompasses all 
the fragments of the prior art having between 6 and 27 amino acids (Article 33.2 
PCT). 

In addition, the fragments "of at least 6 continuous amino acids" of claim 1 do not 
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INTERNATIONAL PRELIMINARY International application No. PCT/USOO/06707 
EXAMINATION REPORT - SEPARATE SHEET 

appear to solve any technical problem. Consequently said fragments are merely 
considered as an arbitrary selection among all possible fragments of at least 6 
amino acids. Therefore even in absence of undetermined amino acids in the SEQ 
ID NO: 2, claim 1 would be considered as lacking an inventive step (Article 33.3 
PCT). 

3. D2 discloses a sequence which is 97% identical to the sequence of SEQ ID NO:3 
in 485 nucleotide overlap. A fragment having said sequence of D2 inherently 
selectively hybridises with the sequence of SEQ ID NO:3. D2 is therefore 
prejudicial for the novelty of claim 4 f). 

4. D1 (page 41 line 16 to page 44 line 17) describes a p-secretase inhibitor assay 
utilizing purified p-secretase which anticipates claim 17 of the present application 
(Article 33.2 PCT). 

Re Item VIII 

Certain observations on the international application 

5. The terms "functional equivalent" used in claim 1 are vague and unclear and leave 
the reader in doubt as to the meaning of the technical features to which they refer, 
thereby rendering the definition of the subject-matter of said claim unclear (Article 
6 PCT). 

6. Claim 1 9 does not meet the criteria of Article 6 PCT. The present application does 
not provide a p-secretase different from the protein having the sequences shown 
in SEQ IDs No: 2 and 4. Therefore claim 19 is not supported by the description 
over its entire scope. 

7. The same enzyme should not be given different names in the claims (Article 6 
PCT). In claims 1 , 4 and 1 1 the enzyme of the present application is named "an 
amyloid precursor protein protease" and in claims 17 to 20 said enzyme is 
referred to as a " p-secretase". 
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jCT3 RSTd PCT/PTO 3 0 OCT 200 

IN THE UNITED STATES RECEIVING OFFICE (USRO) 

Applicant(s): Gerald Wayne Becker 
International Application No.: PCT/US00/06707 
Filed: 05/09/2000 (05.09.00) 

Invention: AMYLOID PRECURSOR PROTEIN PROTEASE AND RELATED NUCLEIC ACID 

COMPOUNDS 

Lilly Reference: X-1 1 600 

Earliest Priority Date: 05/1 1/1999 (05.1 1 .1999) 



Certificate Under 37 C.F.R. § 1.10 

Attention: DO/EO 
Box PCT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir/Madam: 

"Express Mail" mailing labeLpumber: EL559725839US 
Date of Deposit: 

I hereby certify that the following attached paper or fee 

Transmittal Letter to the United States Designated/Elected Office (US) concerning a filing under 
35 U.S.C. 371 of the International Application identified above is being deposited with the United 
States Postal Service "Express Mail Post Office to Addressee" service under 37 C.F.R. 1.10 on 
the date indicated above and is addressed to the Assistant Commissioner for Patents, 
Washington, D.C. 20231. 

(Typed or printed name of person mailing paper) 
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Applicant's or agent's file reference 
X- 11600 


See Notification of Transmittal of International 
FOR FURTHER ACTION Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 
PCT/US00/06707 


International filing date (day/month/year) 
09/05/2000 


Priority date (day/month/year) 
11/05/1999 


International Patent Classification (IPC) or national classification and IPC 
C07K 14/47 


Applicant 

ELI LILLY AND COMPANY et al. 



1. This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 6 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



I 




II 


□ 


III 




IV 


□ 


V 


El 


VI 


□ 


VII 


□ 


VIII 





citations and explanations suporting such statement 



Date of submission of the demand 
13/11/2000 


Date of completion of this report 
27.07.2001 


Name and mailing address of the international 
preliminary examining authority: 

^ European Patent Office 
0})) D-80298 Munich 

C^ 7 Tel. +49 89 2399 - 0 Tx: 523656 epmu d 
Fax: +49 89 2399 - 4465 


Authorized officer 

Loubradou, G (I l) 

Telephone No. +49 89 2399 8543 
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INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/USOO/06707 



I. Basis of the report 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7)): 
Description, pages: 

1-72 as originally filed 

Claims, No.: 

1 -22 as originally filed 

Sequence listing part of the description, pages: 

1-11, filed with the letter of 07.08.2000 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form, 
furnished subsequently to this Authority in written form. 

K furnished subsequently to this Authority in computer readable form. 

K The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

H The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 
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5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
IS claims Nos. 21, 22. 

because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 

□ the description, claims or drawings {indicate particular elements beloW) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

no international search report has been established for the said claims Nos. 21, 22. 

2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 2, 3, 5-16, 18-20 

No: Claims 1,4,17 

Inventive step (IS) Yes: Claims 2, 3, 5-16, 18-20 
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No: 



Claims 1, 4, 17 



Industrial applicability (IA) Yes: 

No: 



Claims 1-20 
Claims 



2. Citations and explanations 
see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
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see separate sheet 
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Reference is made to the following documents: 

D1 : WO 96 40885 A (MCCONLOGUE LISA C ;KEIM PAMELA S (US); 
SINHA SUKANTO (US); TAN HUA) 19 December 1996 (1996-12-19) 

D2: DATABASE GENEMBL [Online] 6 June 1996 (1996-06-06) HILLIER 
ET AL.: 'yx480g8.r1 soares melanocyte 2NbHM Homo sapiens cDNA 
clone IMAGE:265022 5' similar to contains Alu repetitive element' 
XP002 148339 



Re Item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

1 . D1 discloses the purification of a (5-secretase protein. The starting material of D1 
is identical to the starting material the present application, the 0-secretase activity 
assay are similar and several purification steps are shared with the purification of 
the present application (see D1 pages 28 to 30 paragraph entitled "Purification 
and characterisation of p-secretase"). The partial sequences obtained from the 
protein of D1 seem not to be present in SEQ ID NO:2 or SEQ ID NO:4 of the 
present application therefore the proteins having said SEQ IDs appear to be novel 
with regard to the protein of D1 . However the present claim 1 is drafted so broadly 
that its scope encompasses any protein being a "functional equivalent" of the 
proteins having said SEQ IDs without any limitation in sequence requirement (see 
also paragraph 5.). Therefore D1 is prejudicial for the novelty of claim 1 . 

2. The attention of the applicant is drawn to the fact that the SEQ ID NO:2 comprises 
a stretch of 27 undetermined amino acids. This implies that the statement "a 
fragment of at least 6 continuous amino acids thereof" in claim 1 encompasses all 
the fragments of the prior art having between 6 and 27 amino acids (Article 33.2 
PCT). 

In addition, the fragments "of at least 6 continuous amino acids" of claim 1 do not 
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appear to solve any technical problem. Consequently said fragments are merely 
considered as an arbitrary selection among all possible fragments of at least 6 
amino acids. Therefore even in absence of undetermined amino acids in the SEQ 
ID NO: 2, claim 1 would be considered as lacking an inventive step (Article 33.3 
PCT). 

3. D2 discloses a sequence which is 97% identical to the sequence of SEQ ID NO:3 
in 485 nucleotide overlap. A fragment having said sequence of D2 inherently 
selectively hybridises with the sequence of SEQ ID NO:3. D2 is therefore 
prejudicial for the novelty of claim 4 f). 

4. D1 (page 41 line 16 to page 44 line 17) describes a p-secretase inhibitor assay 
utilizing purified p-secretase which anticipates claim 17 of the present application 
(Article 33.2 PCT). 

Re Item VIII 

Certain observations on the international application 

5. The terms "functional equivalent" used in claim 1 are vague and unclear and leave 
the reader in doubt as to the meaning of the technical features to which they refer, 
thereby rendering the definition of the subject-matter of said claim unclear (Article 
6 PCT). 

6. Claim 1 9 does not meet the criteria of Article 6 PCT. The present application does 
not provide a p-secretase different from the protein having the sequences shown 
in SEQ IDs No: 2 and 4. Therefore claim 19 is not supported by the description 
over its entire scope. 

7. The same enzyme should not be given different names in the claims (Article 6 
PCT). In claims 1 , 4 and 1 1 the enzyme of the present application is named "an 
amyloid precursor protein protease" and in claims 17 to 20 said enzyme is 
referred to as a "p-secretase". 



Form PCT/Separate Sheet/409 (Sheet 2) (EPO-April 1997) 




4 



J 



{ 



WORrolNTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

C07K 14/47, C12N 15/57, 9/64, 5/10, 
5/16, A61P 25/2$ 



Al 



(11) International Publication Number: 



WO 00/68266 



(43) International Publication Date: 16 November 2000 (16.1 1.00) 



(21) International Application Number: 

(22) International Filing Date:' 



PCT/USOO/06707 



9 May 2000 (09.05.00) 



(30) Priority Data: 

60/133,423 



11 May 1999 (11.05.99) 



US 



(71) Applicant (for all designated States except US): ELI LILLY 

AND COMPANY [US/US]; Lilly Corporate Center, Indi- 
anapolis, IN 46285 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only)'. BECKER, Gerald, Wayne 
[US/US]; 10815 East 121st Street, Fishers, IN 46038 
(US). HALE, John, Edward [US/US]; 7644 Forest Drive, 
Fishers, IN 46038 (US). HEATH, William, Francis, Jr. 
[US/US]; 11214 Tuffton Street, Fishers, IN 46038 (US). 
JOHNSTONE, Edward, Marion [US/US]; 5129 East 69th 
Street, Indianapolis, IN 46220 (US). LITTLE, Sheila, Parks 
[US/US]; 4480 North Meridian Street, Indianapolis, IN 
46208 (US). TU, Yuan [US/US]; 10848 Madeline Court, 
Fishers, IN 46038 (US). YEH, Wu-Kuang [US/US]; 720 
Primrose Court, Greenwood, IN 46142 (US). YIN, Tinggui 
[CN/US]; 409 Mari Way, Carmel, IN 46032 (US). 



(74) Agents: WILSON, Alexander et al.; Eli Lilly and Company, 
Lilly Corporate Center, Indianapolis, IN 46285 (US). 



(81) Designated States: AE, AG, AL, AM, AT, AU, AZ, BA, BB, 
BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, 
DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, 
IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, 
LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, 
RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, 
UA, UG, US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, 
GM, KE, LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the lime limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: AMYLOID PRECURSOR PROTEIN PROTEASE AND RELATED NUCLEIC ACID COMPOUNDS 
(57) Abstract 



This present invention provides an amyloid precursor protein protease and related nucleic acid compounds thereof. The invention 
also provides compositions, expression vectors, and transfected host cells as well as assays and methods of use. The compounds of the 
present invention will further characterize Alzheimer's Disease and other neurodegenerative disease states. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front 



AL 


Albania 


ES 


Spain 


AM 


Armenia 


FI 


Finland 


AT 


Austria 


FR 


France 


AU 


Australia 


GA 


Gabon 


AZ 


Azerbaijan 


GB 


United Kingdom 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


BB 


Barbados 


GH 


Ghana 


BE 


Belgium 


GN 


Guinea 


BF 


Burkina Faso 


GR 


Greece 


BG 


Bulgaria 


HU 


Hungary 


BJ 


Benin 


IE 


Ireland 


BR 


Brazil 


IL 


Israel 


BY 


Belarus 


IS 


Iceland 


CA 


Canada 


IT 


Italy 


CF 


Central African Republic 


JP 


Japan 


CG 


Congo 


KE 


Kenya 


CH 


Switzerland 


KG 


Kyrgyzstan 


CI 


Cote d'l voire 


KP 


Democratic People' 


CM 


Cameroon 




Republic of Korea 


CN 


China 


KR 


Republic of Korea 


CU 


Cuba 


KZ 


Kazakstan 


CZ 


Czech Republic 


LC 


Saint Lucia 


DE 


Germany 


LI 


Liechtenstein 


DK 


Denmark 


LK 


Sri Lanka 


EE 


Estonia 


LR 


Liberia 



of pamphlets publishing international applications under the PCT. 



LS 


Lesotho 


SI 


Slovenia 


LT 


Lithuania 


SK 


Slovakia 


LU 


Luxembourg 


SN 


Senegal 


LV 


Latvia 


SZ 


Swaziland 


MC 


Monaco 


TD 


Chad 


MD 


Republic of Moldova 


TG 


Togo 


MG 


Madagascar 


TJ 


Tajikistan 


MK 


The former Yugoslav 


TM 


Turkmenistan 




Republic of Macedonia 


TR 


Turkey 


ML 


Mali 


TT 


Trinidad and Tobago 


MN 


Mongolia 


UA 


Ukraine 


MR 


Mauritania 


UG 


Uganda 


MW 


Malawi 


US 


United States of America 


MX 


Mexico 


UZ 


Uzbekistan 


NE 


Niger 


VN 


Viet Nam 


NL 


Netherlands 


YU 


Yugoslavia 


NO 


Norway 


zvv 


Zimbabwe 


NZ 


New Zealand 






PL 


Poland 






PT 


Portugal 






RO 


Romania 






RU 


Russian Federation 






SD 


Sudan 






SE 


Sweden 






SG 


Singapore 







INTERNA 



TjAfVL SEARCH REPORT I (B 



HI No 

PCT/US 00/06707 



A. CLASSIFICATION OF SUBJECT HATTER , , , , 

IPC 7 C07K14/47 C12N15/57 C12N9/64 C12N5/10 C12N5/16 
A61P25/28 

According to International Patent Classification (IPC) or to both national classification and IPC 

B, FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N C07K A61P 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 
Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

STRAND , ENBL, GENSEQ, EPO-Internal 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


DATABASE GENEMBL 'Online! 


1-16 




6 June 1996 (1996-06-06) 






HILLIER ET AL.: "yx480g8.rl soares 






melanocyte 2NbHM Homo sapiens cDNA clone 






IMAGE: 265022 5' similar to contains Alu 






repetitive element" 






XP002148339 






Accesion number N30496 




X 


U0 96 40885 A (MCC0NL0GUE LISA C ;KEIM 


17 




PAMELA S (US); SINHA SUKANT0 (US); TAN 






HUA) 19 December 1996 (1996-12-19) 






page 22, line 20 -page 23, line 2 






page 41, line 20 -page 48, line 34 






claims 23-41; table 3 











Further documents are listed in the continuation of box C 



Patent family members are listed in annex. 



° Special categories of cited documents : 

•A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

*L* document which may throw doubts on priority ciaim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&' document member of the same patent family 



Date of the actual completion of the international search 

4 October 2000 


Date of mailing of the international search report 

17/10/2000 


Name and mailing address of the ISA 

European Patent Office, P.B. 561 8 Patentiaan 2 
NL - 2280 HV Rijswifc 
Tel. (431-70) 340-2040. Tx. 31 651 epo nl, 
Fax: (431-70) 340-3016 


Authorized officer 

ALC0NADA R0DRIG. . , A 



Form PCT71SA/210 ( 



aheet) (July 1882) 



page 1 of 2 



INTERNATT^L SEARCH REPORT 



C.(ConMnuaMon) DOCUMENTS CONSIDERED TO BE RELEVANT 



TtematloriSW&ptteatton No 

PCT/US 00/06707 



Category 



relevant passages 



Citation of document, with indfeaaoawhere appropriate, of the" 

EP 0 653 154 A (HOECHST JAPAN) 
17 May 1995 (1995-05-17) 
page 5, line 13-21 
claim 10 



DATABASE SWISSPROT 'Online' 
1 June 1998 (1998-06-01) 

frJJSSftF A1 " : " Bos taurus butyrophilin 

XP002148340 
Accession No. 046535 

"0 ? 9 46281 A (BAKER KEVIN P ;CHEN JIAN 

(US); 6ENENTECH INC (US); GURNEY AUSTIN () 

16 September 1999 (1999-09-16) 

page 12, line 24-35 

page 60, line 10-29 

page 128, line 15-22 

page 149, line 30-33 

page 151, line 22-30 

page 159, line 17-19 



Relevant to claim No. 



19 



1-16 



1-16 



Form PCT/JSA/210 (continuation of second chaet) (Jury 1082) 



page 2 of 2 



International Application No. PCT/US 00 j06707 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Continuation of Box 1.2 
Claims Nos.: 21,22 



Claims 21 and 22 refer to an antagonist/inhibitor of the polypeptide 
without giving a true technical 

characterization. Moreover, no such compunds are defined in the 
application. In consequence, the 

scope of said claims is ambigous and vague, and their subject-matter is 
not sufficiently disclosed 

and supported (Art. 6 PCT). No search could be carried out for such 
purely speculative claims whose wording is, in fact, a mere recitation of 
the result to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



INTERNA 

Information 



ton o" oatt 



L SEARCH REPORT 

patent family members 




ternatJorimKpplleatton No 

PCT/US 00/06707 



Patent document 




PirKli ration 

t UUll Ul UWf 1 




Patent family 






cited in search report 




date 




member(s) 




date 


U0 9640885 


A 


19-12-1996 


US 


5744346 


A 


28-04-1998 








AU 


6383396 A 


30-12-1996 








EP 


0871720 


A 


21-10-1998 








or 


11507538 


T 


n#w fi7— 1 OQQ 








US 


5942400 


A 


24-08-1999 


EP 0653154 


A 


17-05-1995 


JP 


7132033 


A 


23-05-1995 








CA 


2135595 


A 


13-05-1995 








US 


6037521 


A 


14-03-2000 


W0 9946281 


A 


16-09-1999 


AU 


3072199 


A 


27-09-1999 








AU 


3075099 


A 


11-10-1999 








WO 


9947677 


A 


23-09-1999 








AU 


1532499 


A 


15-06-1999 








EP 


1032667 


A 


06-09-2000 








WO 


9927098 


A 


03-06-1999 








AU 


3757099 


A 


08-11-1999 








WO 


9954467 


A 


28-10-1999 








AU 


1070399 


A 


10-05-1999 








EP 


1025227 


A 


09-08-2000 








W0 


9920756 


A 


29-04-1999 



Form PCT/lSA/210 (patent tamfly annex) (Jtfy 1 992) 





i 



WO 99/46281 





PCT/US99/05028 




GCCCACTTGGCGGTAAGCA^TGCAACTTCTGCiyVCTTCTGCTGGGGCTTTTGGG^SGS 



^CC^^^S?^S;HSEri?5I?T^GTGCTTGCCACAGGGGA^?GGCTCTG 

AGGCGA 
GAGCTC 
CACTTT 

CcS^?r^ CC ^ TCATT ^ 



CTTCT 
CCTCA 
AGGTC 
GCAAG 



c C ?a"S GAGAG JJ CAC ?^ 

TCCTCA 
CAGGTC 

?T A S^ GGG ^ CCAATCC "^ CCGA ^ 



.G TAG C TAT T G A 
GGCCGGCTTTG 
TCTCTGTGTGG 
CTCAGCGATGG 
GCCCATCGAGA 
ACAGCTCCCGG 
GGAGAGCCCCT 
TACGGGGCAGC 
T GGAGAT AG T A 



c?^^^^^SSSE^??^^GTCCCGAGTTGGTCAC?mG?iScTATTGA 

ATGGCCG 
TGTCTCT 
TGCTCAG' 
GTGCCCA 
TCACAGC 
ATGGAGA* 
CAT AC C C* ( 



J^^^^GAGGTCACTGCTCAGAGGTCACTGAACTATGAAGA^TGGCCGGCTTTG 

TCTGTGTG 
CAGCGATG 1 
C CATC GAG 
AGCTCCCG 

? G "^ 



AG ^ GCAGG ^ A TCGCAGAGGACAGCG 

GATG' 
CGAG 

CTACAGCATCCG^AATGG^AATG^G^^^^^^^^^^^^^^^^^^^^^^^^^^^^' 



|||=EiSifs== 



:aaccatcccaggcagaggaatgcc 
:cacaggccagccacgttccaggcc 
3gagaccagggcagtgaggaagccc 

^tcagggccgcgtcagatcctgS 



TGCAAGACACGGTCAACCT^fTTr^fS?^^ CCGGCGm ^ GCCGA ^ GGT CC 



G C C AAG C CAG GAG G CAG CAG GAG T G CAAT C C CAGACACAGAT GG C CC^A^^M^? 



IGGCTGG 



5CIO: <WO_8946281A2_IA> 



WO 99/46281 




PCT/US99/05028 



FIGURE 17QB 

AGGCCAGACAGACCCAGAACAGGAGGAAGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGA 
AGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCCCAGCACAGGTCTGGCCCTGGAC 
CGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCACCACCAACTA 
CCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGT 
TCGGCAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTC 
TCGGAGATGAGCTCACTGCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGC 
CGCCTCCGAGGCGCTGCGGCGGCTCTCGGTCTGCGGGAGGACCCTCAGTTTAGACTTGGCCA 
CCAGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGGTGGAAAGACGGGGACTGAGGGC 
AAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTGTS^ACATACCTCAGACGCCTCTGGAT 
CCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCAC 
TAGCTAGCGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAG 
CCCCAGGACTAACAGCTGACTGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGA 
GGACAGGGACGGTTTGTGGCTGAGATAAGTGTTTCCTGGCAAAACATATGTGGAGCACAAAG 
GGTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACAGGCAGGAAAGGGTGGCCTTCTT 
GGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTATCAA 
TAAAGGAAAAGCAGTAAAAAAAAAAAAAAAAAAAA 
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LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDT 



FIGURE 171 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX (S/T) : 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQA 

GAAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQ 

VLDINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 

GPDETKHAELIWKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESS 

FS I DAKTGQVILRRPL 

DYEKNPAYEVDVQARDLGPNPIPAHCKVLIKVLDVNDNIPSIHVTWASQPSLVSEALPKDSF 

IALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 

QGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 

YRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDA 

NDNAPEWQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 

IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIWEDQGS 

PPLQTRALLRVMFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEK 

KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEA 

GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 

QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 

ILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 

RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPD 

PAWMARLSLPLTTNYRDNVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSL 

LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCL 



amino acids 1-13 

Transmembrane domain: 

amino acids 719-739 

N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 

Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 
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FIGURE 172 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 

CAGACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 

GCAGGAGCCTTCCTTACACTTCGCC&ISAGTTTCCTCATCGACTCCAGCATCATGATTACCT 

CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 

GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 

GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 

GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 

TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 

ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 

AACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 

ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 

CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 

TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 

GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 

AAGT GAAAA.T C T T AC T C T T AT TCAACAGGAAG TGGATGC T T T GGAAGAAT TAAGCAGGCAGC 

TTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 

AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 

CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 

TTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCCAACACATT 

TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 

CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 

AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 

TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 

TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 

AGG C AC C AG AG AAG C AAAT G G C AC C T2S&A.C T T AAG C C T AC T ACAG AC T G T TAG AG G C C AG T 

GGTTTCAAAATTTAGATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAAC 

AAACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 

C TAT G AC CAT GAG TAG CAT C AG C C AGAAC AT GAGAGGG AG AAC TAAC T C AAGACAAT AC T CA 

GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 

AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 

AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 

AAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 

ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 173 



MS FLIDSSIMITSQILFFGFGWL FFMRQL FKD YE I RQ YWQVT FS VT FAFS CTMFEL 1 1 FE I 

LGVLNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRXLFSCLLWLTFMYFF 
WKLGDP FP I LS PKHG ILSIEQLIS RVGVI GVTLMALLS G FGAVNCP YT YMS YFLRNVTDTDI 

LALERRLLQTMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQ 

QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVF 

DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISS 
SKS SNVI VLLLAQ IMGMY FVS S VLLI RMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSA 
LS S I L FLYLAHKQAPEKQMAP 

Important features: 



amino acids 1-23 



Potential transmembrane domains: 



amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398 
425-444 ' ~ ' 



N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



/f^237- 
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FIGURE 174. 

CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 
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FIGURE 175 

GTGTTGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 

TTTNTTGAATTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 

TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 

NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 

TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 

CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 

TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 
TAGGAGATCCCTTTCCCATTCTC 
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FIGURF 17BA 

r a A ^r^ A * AG T GAG . I G S 1 AGAGAG ? GGACAT C G AC C AG G GAG C T GATATGCAAAAGGAAAG 




CTGgIggIga!^^ 

iGAAAG 



^ GG ^ G ^«AGTGGITGGGAGTCAGTGTTCGGAGCCAGGGGCCTGGGGGCA^T : 



^ C ?^rI GGC J GG S G S CCCCCGCGCCMC ^^ G5GT =^GTGGTCAic^G^^^ 



C ? GCG £ CAG IS55ET GGTGCC C<»GGTTATGCTGTCTGGGGAGCGCCTGaCCTCCGGCTTTG 

CCTGATAGTGGGTGCC 



JTCGCCAA 
:CTGTCAG 

:tgacacc 
ictccacg 



SSJJf^SIS^SSJ^^^^^ACCATGGGAGCAGCCTGGGG^TGTCGCCAA 

CO 



aaS^ GC ? C ^ c ^ gggccagacccatcctccatgtct ^ca;gagg C ^^ 



G A AG C ? G r GGAGACGCCATGTTCCAGCTCCAGG ^ TGTC ^A^ C H C fc1 



g ^ ag J gacc H g ^SI a E a ?I c ! ccag a^cctcggctccggcgacaggctcctggccaggg 

g g g c agag at c c 

cagctggtccac 

ggatgtggatgg 

C CC gS CCA £ GGACCCAGCCCAGC ^^ 



AC?? CC ? CCAG ? GGCCCCCATCCTCAATGCCCflCCAGCC ^CACCCAGCGGGCAGAGATCC 
AAGAAGAG £ GG ^ G ^^ T ^^ACTGAGTGGGCAGC^GTCATTGGCCTGGAGC 



^CTCTTCTTCTCTGGTGTGGTGAGGGRCGAGarra^™^^ 



A^^^^^^SSEEI^T^CCTTACK^CTCCGGGATC^TTGAGACC 

CCAGTCTC 

A^rAA^™'™ TGGTGTGGTGA ^ 

Trr^?nrS^^ ATGAGGTCACGGTTTCCAACCAAG GCCAGTCGCT^ 
^S AAG ^ GAAAAACATCACCCTGGA CTGCGCCCGGGGCACGGCi 



g|AGS^Sc G CC G S C G C S 

^ cccac I c I acagcttt ^cgcgcggctgtgctgcatgt GC ggSgccg?? G c?g 
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FIGURE 176B 



CCCCCGGCGGGAGGGCCCGGATGCACACCCCATCCTGGCTGCTGACGGGCATCCCGAGCTGG 

GCCCCGATGGGCATCCAGGGCCAGGCACCGCCIAfiGTTCCCATGTCCCAGCCTGGCCTGTGG . 

CTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGATGAAGAGGGTAGAGTGGGCTGCTG 

GTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCAGGGGCACAGACCTCTCCCACCCACAA 

GAACTCCTCCCACCCAACTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGAC 

AGGGCCATGGGGTAGGGTGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGAT 

CCTAATCCCTTCCTCTCCCATTCACCCTGTGTAACAGGACCCCAAGGACCTGCCTCCCCGGA 

AGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCACTGACTCAGGCTGCTCCTTCTCT 

AGTTTCCCCTCTCATCTGACCTTAGTTTGCTGCCATCAGTCTAGTGGTTTCGTGGTTTCGTC 

TAT T TAT T AAAAAAT AT T T GAGAACAAAAAAAAAAAAAAAAAAAA 
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FIGURE 17? 

></usr/ S eqdb2/s S t/DNA/Dnaseqs.min/ Ss .DNA55737 
xsubunit i of 1, 1141 aa, 1 stop 

><MW: 124671, pi : 5 .82, NX{S/T): 5 

PEVMLSGERLisGFG^VA^ 
SPLRLCGSPDSMFGIS^L^TMn^ 

lsgqpvigl£^tolpsdpaqpSSgSda^^ 

ENASHVECELGNPMKRGAQVT^S?^ 
*ELPLSIAGMAIPQQLFFSGVTOGERAMn«^ 

WPHE I ANGKWLL YPMOVELEG^nr 522? ^c5» GSKVKYEVTV SNQGQSLRTLGSAFLNIM 



amino acids 1-33 



amino acids 1039-1064 
N-glycosylation sites . 

amino acids 86-89, 7,6-749, 949-952, 985-988 and 1005-1008 



^lttf 24 ^l™ : ll%l°\ 2 ^°]l- 317-34*. 443-465, 385- 
and 1031-1047 ' 322 ~ 346 ' 470-479, 442-466, 379-408 
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FIGURE 178 



CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 

AAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCT 

CACAACAAG ATG CTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 

GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 

T G GAT G AT AAAC AAT G G C T C AC C AC AAT C T C T CAG TAT G AC AAG GAAG T C G GACAG T G GAAC 

AAATTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGA 

TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 

GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 

AAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTG 

CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 

AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCA^ATGTGAAGGACATTGC 

C CAT G T C C T T CAG A T AAG C C C AC CAG T AC AAG CAG AAAT G T T AAGAG AG CAT G CAG T G AC C T 

GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 

G T C AAAAC AAG AAG AC AAAAAC AT T G C T GAG G C C T GAG AG AAG CAG AT T C GAT AC CAG CAT C 

TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 

GCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAATGAACAGTGTACCAAGG 

CAT T C T T C AAT T C T T G T G AC AC AT AC AAG GACAG T T T AAT AT C T AAT AAT GAG T G G T G C T AC 

TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 

AGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGC 

CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 

ATGGGATCCAGAATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAGATTT 

T G C TAG T G G C GAT T T T C A T G AAT G G AC T GAT GAT GAG GAT GAT GAAG AC GAT AT TAT G AAT G 

AT GAAG AT G AAAT T GAAG AT GAT GAT GAAGAT GAAGG GG AT GAT GAT G AT GG T GG T GAT G AC 

CAT G AT G T AT AC AT T TGA T T GAT GACAG T T G AAAT C AAT AAAT T C T ACAT T T C T AAT AT T T A 

CAAAAATGATAGCCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCA 

CAT AT AT T T T G TAT AAT TAT T T GAAAAAT T GC AGC TAAAGT TATAGAACT T TAT GT T T AAAT 

AAG AAT CAT TTGCTTTGAGTTTT TAT AT T C C T T AC AC AAAAAGAAAAT AC AT AT GC AG T C T A 

GTCAGACAAAAT AAAG T T T T GAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 

A2\ATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAG 

ATAATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 

AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 

GATAACAGAGAGAT ACC AC AT GAC TCCAAAAAAAAAAAAAAA 




BNSDOCID: <WO 9946281 A2_IA> 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA49829 
xsubunit 1 of 1, 436 aa, l stop 

XMW: 49429, pi: 4.80, NX(S/T): 0 

MLKVS AVLCVCAAAWCS QS LAAAAAVAAAGGRS DGGNFLDDKQWLTT I S QYDKEVGQWNKFR 

DEVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHRMKEA 
GVDHRQWRGP I LS TCKQCPWYPS PVCGSDGHTYS FQCKLE YQACVLGKQ I S VKCEGHCPCP 

SDKPTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTSILPI 
CKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFO 

™n? TELSNIQKRQGVKKLLGQYIPLCDEDGYYK ^ 

RINGVADCAIDFEISGDFASGDFHEWTDDEDDEDDIMNDEDEIEDDDEDEGDDDDGGDDHDVYI 



amino acids 1-16 



amino acids 24 6-267 



N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 



A 
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FIGURE 180A 



CAGAC T C CAGAT TTCCCTGT CAACCACGAGGAGT CCAGAGAGGAAAC GC GGAGCGGAGACAA 

CAGTACCTGACGCCTCTTTCAGCCCGGGATCGCCCCAGCAGGGAIfiGGCGACAAGATCTGGC 

TGCCCTTCCCCGTGCTCCTTCTGGCCGCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGC 

TTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTTCCCGCCGGCCAGAAGGAGTGCTT 

CTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTAGATGGAGCAG 

GATTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGA 

AAATCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGACAA 

TACATTCAGCACCATTTCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAATATGGGAG 
AAC AG G C AC AAGAAC AAGAAGAT TG GAAGAAATAT AT T AC T GG CACAGAT AT AT T GGAT ATG 

AAACTGGAAGACATCCTGGAATCCATCAACAGCATCAAGTCCAGACTAAGCAAAAGTGGGCA 
C AT AC AAAT T C T G C T TAG AG CAT T T GAAGC T C GT GAT C GAAAC AT ACAAGAAAGC AAC T T TG 

ATAGAGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAA 
G T T TAT AT G C T GAAG AG T C T G T T TG AAGATAAGAGGAAAAG T AGAAC TI&ftAAC T C CAAAC T 
AGAG T AC G T AAC AT T GAAAAAT GAG G CATAAAAATG C AAT AAAC T G T T ACAG T CAAGACC AT 
TAAT GGTCTTCTC GAAAAT AT T T TGAGATATAAAAG TAG GAAACAGG TATAAT T T T AAT G TG 

AAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAGTTGTACTTAAGTGTGTA 

ACAGGAATATTTTGCAGAATATAGGTTTAACTGAATGAAGCCATATTAATAACTGCATTTTC 

CTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTAAT 

TGCAACACCAGTCTGTTTTTAACAGGTTCTATTACCCAGAACTTTTTTGTAAATGCGGCAGT 

TACAAATTAACTGTGGAAGTTTTCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGA 

TGGATTGAATAAATCTTTAGACTACAAAAGCCCAACTTTTCTCTATTTACATATGCATCTCT 
C C T AT AA T G T AAAT AG AAT AAT AG C T T T GAAAT ACAAT TAG G T T T T T GAGAT T T T TAT AAC C 

AAATACATTTCAGTGTAACATATTAGCAGAAAGCATTAGTCTTTGTACTTTGCTTACATTCC 
C AAAAG C T GAC AT T T T C AC GAT T C T T AAAAACACAAAG T T AC AC T TAC T AAAAT TAGGACAT 

GTTTTCTCTTTGAAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTC 
T C T AAC C C T T AAC T AAAG T G TAG GAT T T TAAAAT T AAAT G T GAG G T AAAAT AAG T T TAT T T T 
T AAT AG TAT C T G T C AAG T T AAT AT C T G T CAACAG T T AAT AAT CAT G T TAT G T TAAT T T TAAC 
AT GAT T G C T GAC T T G GAT AAT T CAT TAT T ACCAG CAG T TAT GAAG GAAAT AT T G C T AAAATG 

ATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGG 
AAAGAAT T T AGAAAAAC T T G AGAAAACC TAAT C C AAAAT AAAAT T CAC T T AAG T AGAAC TAT 

AAATAAATATCTAGAATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGG 

AGATGATTAATTTCCAGTTAGCTGGAAGAAACTTTGGCTGTAGGTTTTTATTTTCTACAAGA 

ATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTTATAAAATGTAAGCCCTACTGTAAG 

GTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTATTAAAATGG 
C C T T T C T G AAC ACTTTATTTATT GAT G T T GAAG T AAG GAT T AGAAACAT AGAC T C C GAAG T T 

TTAAACACCTAAATGTGAATAACCCATATATACAACAAAGTTTCTGCCATCTAGCTTTTTGA 
AGTCTATGGGGGTCT TAC T C AAG TAC TAG TAAT T TAAC T T CAT CAT G AAT G AAC TATAAT T T 
T T AAG T T AT G C C CAT T TAT AAC G T T G T T TATGAC TAC AT T G T GAG T T AGAAACAAAC T T AAA 

ATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATTCTTGATGAGCAATAATGA 
TAAC CAG AG AG T G AT T T CAT T T AGAC T C AT AGTAG TATAAAAAGAG AT AC AT TTCCCTCT TA 

GGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAA 
TGCCGTATATGATCAATTACCTTAATTGGCCAAGAAAATGCTTCAGGTGTCTAGGGGTATCC 
TCTGCAACACTTGCAGAACAAAGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTG 
C GC TG T T AAAT T T G C AAT GAGAAGCAAAT T TACAGTAC CAT AAC TAATAAAGCAGGGT AC AG 

ATATAAACTACTGCATCTTTTCTATAAAACTGTGATTAAGAATTCTACCTCTCCTGTATGGC 
TGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCTAC 
ATG TAT GAT T T G T G C CAC T GAT C T TAAAC C T ATG AT T CAG TAAC T T C T T ACC AT AT AAAAAC 
GAT AAT T G C T T TAT T T G GAAAAGAAT T T AGGAATAC T AAGG AC AAT TAT T T T T AT AGACAAA 
G TAAAAAGACAGATAT T T AAG AGG CATAAC CAAAAAAG C AAAAC T T G TAAAC AGAGTAAAAA 
TCTTTAATATTTCTAAAGACATACTGTTTATCTGCTTCATATGCTTTTTTTAATTTCACTAT 
TCCATTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTTTATCACTTAACAGCTCATTT 
TGTCTTTTTCAATATACAAATTTTAAAAATACTACAATATTTAACTAAGGCCCAACCGATTT 
CCATAATGTAGCAGTTACCGTGTTCACCTCACACTAAGGCCTAGAGTTTGCTCTGATATGCA 
TTTGGATGATTAATGTTATGCTGTTCTTTCATGTGAATGTCAAGACATGGAGGGTGTTTGTA 
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FIGURE 18QB 

ATTTTATGGTAAAATTAATCCTTCTTACACATAATGGTGTCTTAAAATTGACAAAAAATGAG 
CACTTACAATTGTATGTCTCCTCAAATGAAGATTCTTTATGTGAAATTTTAAAAGACATTGA 

TTCCGCATGTAAGGATTTTTCATCTGAAGTACAA.TAATGCACAATCAGTGTTGCTCAAACTG 
C T T TAT AC T T AT AAAC AG C CATC T T AAATAAG GAACG TAT T G T GAG TAC T GAT AT G TAT AT A 
ATAAAAAT TAT CAAAGGAAAA 
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FIGURE 181 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA52196 

xsubunit 1 of 1, 229 aa, 1 stop 
XMW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEY 

QVLDGAGLD I DFHLAS PEGKTLVFEQRKS DGVHTVETEVGDYMFC FDNT FS T I SEKVI FFEL 
ILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRN 

I QE S NFDRVNFW SMVNLWMVWS AI QVYMLKS L FE DKRKSRT 
Important features: 



amino acids 1-23 

Transmembrane domain : 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site. 

amino acids 55-62 
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CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAraraaa 

C T C AC C AG AG AG T C G C AGAC AC T^JgCTG C C T CC CAT G G^C C^TG^C^AG^G TGTr ^^^^ 

CTGCTTTCCTGCCTCATTCTrrTrTr^ ^^CAGTGTGTCCTGGATG 

TTTTGTCACCAAAATCCTGGATGr J-ATGGCTCCCCCTGCTATGCCTTGT 



TTCTTTTTCCTTTTTCTTCACCTTCAT^^GGCTTTTCTCTGTCTTCCAT 
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FIGURE 183 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56965 
<subunit 1 of 1, 175 aa, 1 stop 
<MW: 19330, pi: 7.25, NX(S/T): 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 

amino acids 1-26 

C-type lectin domain signature. 

amino acids 146-171 
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CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAG 




GCAAGCCTGGGGTGAGAGC 
TGGCGCTGGTGCTGGC 

T 



CGGACTGTGTCACCAICGCCACCTGCACCACCSACGAAACCATGTGCAAGACCACACTCTAC 

GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 




GCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAAAAAAAA^ 1 
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FIGURE 185 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA56405 
<subunit 1 of 1, 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX(S/T): 1 

MRGTRLALLALVLAACGELAPALRCYVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVYP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCNTELCNVDGAPALNSLHCGALTLLPLLSLRL 

Important features : 
Signal peptide: 

amino acids 1-17 

N-glycosylation site . 

amino acids 46-49 
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FIGURE lap 



ACGGTTTCGTGGGGACCCAGGCTTGCaAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 




TACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 

CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCAGCTTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACC^ 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 
GACACTTCTGGICCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCI^C 
TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 




TCCTTTTATATAATAGATGCTATGAAAAi 



CC 



TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGITAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGIAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 

AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAG^ACGCA 

ag^tctcitggaatgacaaatgataggtacctaaaatgtaacaigaaaatacta^tat" 

TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAABATTTAACATTTAAAAAAAAAAAAA wAlAGTTTTTGA 
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FIGURE 1B7 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57530 
<subunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX (S/T) : 1 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSA 
APGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRKRCMRH 
AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 

SVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQ 
KDHHQASNS SRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 
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FIGURE 188 

TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 
ACGGGAAGGTTTTGTGGGACCCAGGTTGAAATGACGGTCATTTTTTTTTCTTTCTCCTTCNG 
GAGTCCTTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 
GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 

CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
C T GAT C AAAAT C AT T T C C G AG G AGAAA.T T GAG GAAAC CAT C AC T G AAAG C T T T G G T AAT GAT 
CATAGCACCTTGGATGGG 
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CATTCACAGATTflA?AT?T"GSGAS^?^S™I A ^ G AGC^S? 



T C AGAGAC T T GAAAAG GAT CAT TC TCT G T T TTCT r a t a t r- -r * ^^^AAGAT GAC TATA 

^catggcgattttttaaaaaVa^ 

i.C TCAAAGATGAAGATC TACGG 
^. iiWV3i ,„,.„............„„___ .TCAAGGTGGCCAGTATGAAAG 

AGAACTTTGGACAGGAAATTTTGT^r^ 




TTATGAATCCTTCAGTGGTCTCOTTTA^ 




ggatcagggcctatccaactgttaagtU¥sS^S;™ gctcagacatgcca saaagctg 

GAAGAGCAGATAAATACCAGAGATVraBaa^jrrS^^^^^^^^^^^^^^AAATTTTCAA 

wctctccgaaatcaaSga^^^ 




.TG 

ACTTGTTCTCCTAAAAATGCTATCCCTAACCA^a^ 

tgatgtggcacagtaaacaaIccctg^S 

TTCTTTCCTTC^^ 
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CAGTATATGTGCACACAGTAAGTACACAAATTTGAGCAACAGTAAGTGCACAAATTCTGTAG 
TTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAAATTATAGCAATTAACTGGGCATTGTA 

GAG TAT C C T AAAT AT G T TAT CAAG TAT T T AGAGT T C TATAT T T T AAAGAT ATAT GT GT T CAT 
GTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTA 
ATATTTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTT 
TTCTTTCATAGTTTTGGTTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAAT 

tactttacaggttgttttactgtagcttataatgatactgtagttattccagttactagttt 

ac t g t c ag ag ggctgccttttt cagat aaat at t gacat aat aac t gaag t tat t t t t at aa 
gaaaatcaagtatataaatctaggaaagggatcttctagtttctgtgttgtttagactcaaa 
gaatcacaaatttgtcagtaacatgtagttgtttagttataattcagagtgtacagaatggt 
aaaaattccaatcagtcaaaagaggtcaatgaattaapaggcttgcaactttttcaaaaaaa 

aaaaaaaaaa 
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</usr/ seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56439 
<subunit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGVWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLALKL 
HPDKNPNNPNAHG D FLK I NRAYEVLKDE DLRKKYDK YGEKGLE DNQGGQ YE S WNY YR YD FG I 
YDDDPE 1 1 T LERRE FDAAVNS GELWFVNFYS PGCSHCHDLAPTWRDFAKEVDGLLRI GAVNC 
GDDRML CRMKGVNSYPSLFIFRS GMAPVK YHGDRS KE S LVS FAMQHVRS TVTELWTGNFVNS 

IQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 

TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQP 

SLAVFKGQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 

PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 
AE QILEFIED LMN P S W S L T P T T FNE L VT QRKHNE VWMVD F YS P W CH PC QVLMPE WKRMART 

LTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 
FLPQVS TDL TPQT FS EKVLQGKNHWVI DFYAPWCGPCQNFAPE FELLARMI KGKVKAGKVDC 

QAYAQTCQKAGIRAYPTVKFYFYERAPCRNFQEEQINTRDAKAIAALISEKLETLRNQGKRNKDEL 
Important features: 

Endoplasmic reticulum targeting sequence. 

amino acids 744-74 7 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature . 

amino acids 77-96 

N-glycosylation site. 

amino acids 484-487 
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FIGURE 191 

AGAC AG T AC CTCCTCCC TAGGAC TACACAAGGAC TGAACCAGAAGGAAGAGGACAGAGCAAA 
GCC^ISAACATCATCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGA 
GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 
T TAC T GGAGCT GGGCAT GGAATAGGCAGGCAGACTACTTATGAAT TTGCAAAACGACAGAGC 
ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 
AC TAG G C G T C AC T G C G CAT G C G TAT G T G G T AGAC T G CAGC AACAGAGAAGAGATC T ATC GCT 

CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 
GT AT AT C C AG C C GAT C T T C T C AG C AC CAAG GAT G AAGAGAT TAC C AAGACAT T T GAG G T CAA 
CAT C C T AG G AC AT TTTTGGAT C AC AAAAG C AC T T C T T C CAT C GAT GAT G G AGAGAAAT CAT G 

GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 

TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 

GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGAATACTGGGTTCACCA 

AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 

GATGGAATACTTACCAATAAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTTCTGAG 

ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 
T T G AAG CAGTGGTTGGC C AC AAAAT C AAAAT G AAAT£&AT AAAT AAG C T C CAG C C AGAGAT G 

TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 
T CAG T C C T GATAATAT TAAAAACATT GGT T TGGCAC TAGC AGCAG T CAAACGAACAAGAT TA 

ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 

T C AT G C C T C T T AAAAAC T T C T G T G C T TACATAAACAT AC T T AAAAGG T T T T C T T TAAGAT AT 

TTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 

TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 
AAT AT C AAC AAGAAC AC AG AAT GAG T G C AC AG C T AAG AG AT CAAG T T T C AG CAG G CAG C T T T 

ATCTCAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 

TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 

AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 

TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 

TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 
T T TAG GCT C AAAAAT T AAAG C T AACACAGGAAAAGGAAC T G TAC T GG C TAT T AC AT AAGAAA 
CAATGGAC C CAAGAGAAGAA 
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FIGURE IflP 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56409 
<subunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNI I LE I LLLL I T 1 1 YS YLESLVKFFIPQRRKSVAGEIVLI TGAGHGIGRQTTYEFAKRQS I 
LVLWDINKRGVEETAAECRKLGVTAHAYWDCSNREEIYRSLNQVKKEVGDVTIWNNAGTV 
YPADLLS TKDEE I TKT FEW I LGHFWI TKALLPSMMERNHGHI VTVASVCGHEG I P YL I PYC 

SSKFAAVGFHRGLTSELQALGKTGIKTSCLCPVFVNTGFTKNPSTRLWPVLETDEWRSLID 
GIL TNKKM I FVP S Y I N I FLRLQKFL PE RAS AI LNRMQN I Q FE AWGHKI KMK 

Important features : 
Signal peptide: 

amino acids 1-19 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 

amino acids 165-202, 37-49, 112-122 and 210-219 
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FIGURE 1Q3 

CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 

ATGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 

AGG^IfiACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 

GCTGATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 

CTAGGCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 

GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 

CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 

GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 

CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 

CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 

CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 

AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 

CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 

CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTGGTGCGCGACCCC 

TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 

CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 

GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 

CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 

CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 

AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 

ACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 

GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 

GAGAC3^AAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 

AGTTTTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
AT T GAG T AC T G TAT C GAT AT T G T T T T T T AAG AT T AAT AT AT T T C AG G TAT T T AAT AC G A 
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FIGURE 194 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA56112 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T): 4 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RS VLRGFCANS S LAFPTKERAFDDI PNSELSHL IVDDRHGAI YC YVPKVACTNWKRVMIVLS 
GSLLHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPF 

VRL I SAFRSKFELENEE FYRKFAVPMLRL YANHTSLPASAREAFRAGLKVS FANFI QYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 

AS S WEE DW FAKI P LAWRQQL YKL YEAD FVL FG Y PKPENLLRD 

Important features : 
Signal peptide : 

amino acids 1-31 

N-glycosylation sites . 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine-rich region protein 

amino acids 329-332 
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ATGCCGGGSAGGTGGTGGTCGTGACCGGGSGCGGGCGCGGCATrrrz,^^ """^CGCT 

GCCTTCGTGAACAGCGGGGCCCGflGTGGTTATCTGCGACAAGGATG^CTCTGGKGCCr^^ 
CCTGGAGCAGGAGCTCCGTGGAGCTGICTTTATCCTCTGTGA^f ^ 




====== 



WO 99/46281 





PCT/US99/05028 



FIGURE 196 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA56045 
<subunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1 

MAT G T R Y AGKVVWT GGGRG I GAG I VRAFVNS GARWI C DKDE S GGRALEQELPGAVFILCD 
VTQEDDVKTLVSETIRRFGRLDCWNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALPYLRKSQGNVINISSLVGAIGQAQAVPWATKGAVTAMTKALALDESPYGVRVNCISPGN 
IWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELGYGCKASRSTPVDAPDIPS 

Important features: 
N-glycosylation site . 

amino acids 138-141 

Short-chain alcohol dehydrogenase family protein 

amino acids 10-22, 81-91, 134-171 and 176-185 
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FIGURF 197 



AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGfl^ACTGGCCTCACAACCTG 
CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAA 

IZtcZcZr ^ CCGTATGCCCG ™ G ^ T ^^=AACATCGAGGAGATG 
GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 
GTGGATGICCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 
GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 
ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 
CCTCTGCCGGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 




lGAAG C C AG G C C AG C AG C C C GAGA 
CCATGCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 

bLAAG 
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FIGURF 19« 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA59294 
<subunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58, NX(S/T): 1 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNIEEMVAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYSINHDPSRIPVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-17 

amino acids 96-180. 
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FIGURE 199 

GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 
CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 
GCGCCCAAC^ISGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 
CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 
TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 
TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 
TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 
GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 
TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAA.TGGCAATC 
AGTCGAGCCTCTGACTGGCTGGAAA.TCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 
TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 
CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 
TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAA.GGCATTTATCTGAGC 
GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 
GAG GAAAAAG AT GAT T C AAAT GAA.G AA.G AAAAC AAA.GAC AG C C T T G T AGAT GAT GAAGAAGA 

GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 
CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 
ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 
TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAA.GTCAGCATGCTGACAAGGGAC 
T G3&SAT T T AAT GAT G C G T T T T C AAGAAT AC AC AC C AAAAC AAT AT G T C AG CTTCCCTTTGG 

CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 

CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 

GACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 
C AAG T T CAT T T AC T TAG G G G T C AGAGAG TC T C G AC C AGAGGAG G C CAT T C CC AG T C C T AAT C 

AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCT 
CCTGAGCATCCCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 
TTTTGTCAAATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 
ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 
TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 
C C AC AC T C T T C AC TATTATCTCTTGGT C AGAGG AC T C C AAT AAC AG C C AG G T T T AC AT GAAC 

TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 
G AC T G C C AAAC AT C T C AAAT GAAAT GTTGTGGC CAT C AG AG AC T C AAAAG GAAG T AAG GAT T 

T T AC AAG AC AG AT T AAAAAAAAAT T G T T T T G T C CAAAAT AT AG T T G T T G T TG AT T T T T T T T T 

AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 

CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 

TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 

GAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 

TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 
GCTAATTTCTTT 
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FIGURE 200 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ ss .DNA564 33 
<subunit 1 of I , 349 aa, I stop 
<MW: 38952, pi: 4.34, NX(S/T): 1 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAP 
WCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRG 
PGIFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAW 
CSYVFFVIATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 
DDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTRE 

EVEPEEAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain : 

amino acids 191-211 

N-glycosylation site . 

amino acids 4 6-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 

Flavodoxin proteins 

amino acids 173-187 
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E1GURIL2Q1 



CAAAT G C TAT AT C TAT T C AG G GGCT C TC AAGAAr A a E J^^Jf ^^^*^*^AAGAG C TC C 
T T G GAT G AAG AT G GAT A T A C T CAAT TACAC T TP r att^t^^ C AT C C T GAT T T AGAAAAT 

TGTTTCAGAGAAAGGATCGTG^ 
(^TCCTATGCTTGGTAATArJr^ 

CTC T T C TAAC T TAT T T CAGATCAGAACCAr Ar r t a ^J^^ TCTGGGAGGATGGATCAACAT T 



aggaggacagaaaacagaaHgaaaagagtIIca^ 

T T T AGAGAGC T T G G C CAAC T G TAATC T TA A rr a a a ^AAGATAAATGCAGAAAAT G 

ATCTT^TTTCTGOTCTATCA^ 
CCCAAGTAGCTGGGATTACAGTCAG^ 
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FIGURE 207 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53 912 
<subunit 1 of 1, 201 aa, i stop 

<MW: 22563, pi: 4.87, NX(S/T): 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCAASPPWRLIAVILGILCLVILVIAV 

VLGTMGVLSSPCPPNWI IYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 

VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 
CSVPSYSICEKKFSM 

Important features : 

Type II transmembrane domain: 

amino acids 4 5-65 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids .34-67 and 70-200. 
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FIGURE 7r>3A 

ACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGCGT^ACTTCTTGCTGAGT^ACAAA 
GCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCT^ 

GAGGCCCTCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAA^ 



AAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGT 
^S^ CCTCACGGAGCCCGTGGTGCTCACTT TCCAGCACCAGCTACAG^ 

CTCTGCAATGTGTGTTCTGGGTTGAAGACCCC^TTGAGCAGCC^GCMT^S^ 
GCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAAC^ 

TCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCPTAr 

ctctgctccagggtgcccctgccgtgc^gaggamctcgg^tS^?^sSg™ 

CATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTG^ 
CTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATG^ 

C^GGGCcIc^ 

TCATCTTCATCTGGTMTGGTCCATGCGGCTGCAGGCCC^^Scc2?c?^S^ 

AGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCC^ 

SGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGGCCTCGTCGCACACTGCCTGT 

ggcccccgagccaggcccagccccaggccagtcagccgcagactttggaaS 

ATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGCTGGGC^GA^ 

ggggactactcggctctcactcagctcccacgggactc^gaagtgc^ 

GGGTACTGTCCCCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAA^ 

GGCCCAGCCCTCATTGCTGGGGGCCAGGCCTTGGATCTTGAGGGTCTGGCACATC^TT 
CTGTGCCCCTGCCTGGGACM^^ 

GGGCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGGCGAATG^ 

^IES AATGCTGTTTTTT GGGGAGTGGTTTCCAGGAGCTGC^ 
IH?^I A ^ GG ^ AAGCCTCGGCCTGCCCCTGA GCCA^ 

GCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTrArr 
AAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCA^ 
ACCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACT 
GTGACACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCl^CA^^^^CC^^SlA^A^C^: 
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FIGURE 203B 



CAGGCGCCCTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGC 
TGGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATTC 
AGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGCCAT 
CATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGA 
AGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAAAGAAGTA 
CATGTTCATTG T AGAG AAT T T GGAAAC TG T AGAAGAGAATC AAGAAGAAAAAT AAAAAT C AG 
C T GT T G T AAT C G C C T AG C AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAAAAA 
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FIGURE ?OZL 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA50921 
<subunit 1 of 1, 693 aa, 1 stop 

<MW: 77738, pi: 8.87, NX (S/T) : 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 

PGHWSSAGCETVRRE 
IAAYLCSRVPLPC 

RRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLE 
GYNLYRLWEVFGTYVPGYLLKLSAMGWGFPI FLVTLVALVDVDNYGP I ILAVHRTPEGVIY 

PSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMWQILRLRPHTQKWSHVLTLLGLSLVLG 

LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 



KVLGIWQNTKVANLTEPWLTFQHQLQPKNVTLQCVFWVEDPTLSS 
TQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLVT 



Important features 



amino acids 1-25 



amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 



Microbodies C-terminal targeting signal. 

amino acids 691-693 

CAMP- and cGMP-dependent protein kinase phosphorylation sd 

amino acids 198-201 and 370-373 

N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 
amino acids 475-504 
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TGCCTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 

GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 

tacaacctctaccgactcgtggtggaggtgtttggcacgtatgtccctggctLc™ 

GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 

tggacaacxatggcgccatgatgttggctgtgcataggactgcagagggggtcIt" 

TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 
GGTGTTTCTGTTCAACATGG LTTCBCCCT 
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FIGURE 206 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCA 

GGTTTTGCTTTGATCCTTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAAAGTTTT 

GGATGGGATTATGTGGAAACTACCCTGCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCC 

ACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGACTCGGGAGTCGCTGCTTCCAAAGT 

GCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAA4SSAGCCTCTTCGGGCTTCTCCTGCT 

GACATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAAT 

T C C AG T T T T C C AG C AAC AAG GAAC AGAAC GG AG TACAAGAT C C T C AG CAT GAG AGAAT TAT T 

AC T G T G T C T AC T AAT GGAAG TAT T C AC AG C C C AAGG T T T C C T CAT AC T TAT C C AAGAAAT AC 

GGTCTTGGTATG GAGAT TAG TAG C AGT AGAGG AAAAT G T AT G GAT AC AAC T T AC G T T T GAT G 

AAAGAT TTGGGCTT G AAG AC C C AGAAG AT G AC AT AT G C AAG TAT GAT T T T G TAGAAG T T GAG 

GAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAGGAAAACA 

GAT T T C T AAAG G AAAT C AAAT T AGG AT AAGAT T T G T AT C T GAT G AAT AT T T T C C T T C T GAAC 

CAGGGTTCTGCATCCACTACAACATTGTCATGCCACAATTCACAGAAGCTGTGAGTCCTTCA 

GTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTAC 

C T T GGAAG AC C T TAT T C GAT AT C T T GAAC C AG AGAGAT G GC AGT TGGAC T TAGAAG AT C TAT 

ATAGGCCAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAAGAAAATCCAGAGTGGTG 

GAT C T GAAC C T T C T AAC AG AG GAG G T AAG AT TAT AC AG C T G C AC AC C T C G T AAC T T C T C AG T 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTA 

AACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCA 

AGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGG 

ATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCA 

GAGGGAGCACAGGAGGAmSCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGC 

AGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTT 

C AAG G AC C T T T CAT C T T C AG GAT T TAC AG T G CAT T C T GAAAGAGGAGAC AT C AAACAGAAT T 

AGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATC 

G T G G AAAG AAAA T T AAAT G T T G TAT T AAAT AG AT C AC CAG C TAG T T T C AG AG T TAC CAT G T A 

CGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGT 

ACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 

ATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCAC 

AT AT G T AAA C CAG AAC AT T C TAT G TAC T AC AAAC CTGGTTTT T AAAAAG GAAC TAT G T T G C T 

ATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 

TTCTGCCATT TAG AAG AAG AG AAC TAC AT T CAT G G T T T G GAAG AG AT AAAC C T G AAAAG AAG 

AGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTAT 

ATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAA.TATATCTATTTTTAC 

C AAAG G T AT T T AA T AT T C T T T T T T AT G AC AAC T T AG AT C AAC T AT T T T T AG C T T GG T AAAT T 

T T T C T AAAC AC AAT T G T TAT AG C CAGAG GAAC AAAGAT GAT AT AAAAT AT TGTTGCTCT GAC 

AAAAAT AC AT G TAT T T CAT T C T C G TAT G G T G C T AG AGT T AGAT T AAT C T GCAT T T T AAAAAA 

C T G AAT T G GAAT AG AAT T G G T AAG T T G C AAAG AC T T T T T GAAAAT AAT T AAAT TAT CAT AT C 

T T C CAT T C C T G T TAT T GG AG AT GAAAAT AAAAAG C AAC T TAT G AAAG TAG AC AT T C AGAT CC 

AGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGC 

ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACAT 

CCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTGTATAAAT 

AC AT G GAT AT T T T TAT G TAC AGAAG TAT G T C T C T TAAC CAG T T C AC T TAT T G TAC T C T G GC A 

AT T T AAAAG AAAAT CAG T AAAAT AT TTTGCTTG T AAAAT GC T T AAT ATNGT G C C TAG G T TAT 

GTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGTGGCTATTTTGG 

GGAGAAAAT T AAAAAAAAAAAAAAAAAAAAAG G T T TAG GG AT AAC AGG G T AAT G C GG C C 
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FIGURE 707 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGS IHSPR 
FPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTILGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLY 
SCTPRNFSVS IREELKRTDT I FWPGCLLVKRCGGNCACCLHNCNECQCVPSKVTKKYHEVLQ 
LRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 
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FIGURE 208 

CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTCTGCTCTCTTCAACCAGACCTCTACATT 




TCCTTTTTAACATAATCCTAATTTCCAAACTCCTTGGGGCTAGATGGTTTCCTAAAACTCTG 

CCCTGTGATGTCACTCTGGATGTTCCAAAGAACCATGTGATCGTGGACTGCACAGACAAGCA 
T T T G AC AGAAAT T C C T G GAG G TAT T C C C AC GAAC AC CAC GAAC C T C AC C C T C AC CAT TAAC C 

ACATACCAGACATCTCCCCAGCGTCCTTTCACAGACTGGACCATCTGGTAGAGATCGATTTC 
AG AT G C AAC TGTGTACCTATTC CAC T G GGG T CAAAAAAC AAC AT G T G CAT CAAG AGG C T G CA 

GATTAAACCCAGAAGCTTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACC 
AGCTACTAGAGATACCGCAGGGCCTCCCGCCTAGCTTACAGCTTCTCAGCCTTGAGGCCAAC 
AAC AT CTTTTCCAT C AG AAAAG AGAAT C T AAC AGAAC T G G C C AACAT AGAAATAC T C T AC C T 

GGGCCAAAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAGAAAGATGCCT 

TCCTAAACTTGACAAAGTTAAAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCT 
ACTGTTTTGCCATCTACTT T AACAGAAC TAT AT C T C T AC AAC AACAT GAT T GCAAAAAT C CA 

AGAAGATGATTTTAATAACCTCAACCAATTACAAATTCTTGACCTAAGTGGAAATTGCCCTC 

GTTGTTATAATGCCCCATTTCCTTGTGCGCCGTGTAAAAATAATTCTCCCCTACAGATCCCT 

GTAAATGCTTTTGATGCGCTGACAGAATTAAAAGTTTTACGTCTACACAGTAACTCTCTTCA 

GCATGTGCCCCCAAGATGGTTTAAGAACATCAACAAACTCCAGGAACTGGATCTGTCCCAAA 

ACTTCTTGGCCAAA.GAAATTGGGGATGCTAAATTTCTGCATTTTCTCCCCAGCCTCATCCAA 

TTGGATCTGTCTTTCAATTTTGAACTTCAGGTCTATCGTGCATCTATGAATCTATCACAAGC 

ATTTTCTTCACTGAAAAGCCTGAAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTGA 

AAAGCTTTAACCTCTCGCCATTACATAATCTTCAAAATCTTGAAGTTCTTGATCTTGGCACT 

AACTTTATAAAAATTGCTAACCTCAGCATGTTTAAACAATTTAAAAGACTGAAAGTCATAGA 

TCTTTCAGTGAATAAAATATCACCTTCAGGAGATTCAAGTGAAGTTGGCTTCTGCTCAAATG 
C C AGAAC T T C T G T AG AAAG T TAT GAAC C C C AGG T C C T G GAAC AAT TAC AT TAT T T CAGAT AT 

GATAAGTATGCAAGGAGTTGCAGATTCAAAAACAAAGAGGCTTCTTTCATGTCTGTTAATGA 
AAGCTGCTACAAGTATGGGCAGACCTTGGATCTAAGTAAAAATAGTATATTTTTTGTCAAGT 
CCTCTGATTTTCAGCATCTTTCTTTCCTCAAATGCCTGAATCTGTCAGGAAATCTCATTAGC 
CAAACTCTTAATGGCAGTGAATTCCAACCTTTAGCAGAGCTGAGATATTTGGACTTCTCCAA 

CAACCGGCTTGATTTACTCCATTCAACAGCATTTGAAGAGCTTCACAAACTGGAAGTTCTGG 
AT AT AAG C AG T AAT AG C CAT TAT T T TCAAT C AGAAG GAAT TAC T CAT AT G C T AAAC T T TAC C 
AAGAAC C T AAAG G T T C T G C AGAAAC T GAT GAT GAAC GAC AAT GACAT CTCTTCCTCCAC C AG 

CAGGACCATGGAGAGTGAGTCTCTTAGAACTCTGGAATTCAGAGGAAATCACTTAGATGTTT 
TAT G GAG AG AAG G T GAT AAC AG AT AC T T ACAAT TAT T C AAGAAT C T G C T AAAAT T AGAGG AA 

TTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGTATGCCTCC 

AAATC TAAAGAATC T CT C T T TGGCCAAAAATGGGCTCAAATCTTTCAGTTGGAAGAAACTCC 

AGTGTCTAAAGAACCTGGAAACTTTGGACCTCAGCCACAACCAACTGACCACTGTCCCTGAG 
AG AT TAT C C AAC T G T T C C AGAAG CC T CAAGAAT C TGAT T C T TAAGAATAAT C AAATC AGGAG 

TCTGACGAAGTATTTTCTACAAGATGCCTTCCAGTTGCGATATCTGGATCTCAGCTCAAATA 

AAATCCAGATGATCCAAAAGACCAGCTTCCCAGAAAATGTCCTCAACAATCTGAAGATGTTG 

CTTTTGCATCATAATCGGTTTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGGGTTAA 

CCATACGGAGGTGACTATTCCTTACCTGGCCACAGATGTGACTTGTGTGGGGCCAGGAGCAC 

ACAAGGGCCAAAGTGTGATCTCCCTGGATCTGTACACCTGTGAGTTAGATCTGACTAACCTG 

ATTCTGTTCTCACTTTCCATATCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCA 

CCTCTATTTCTGGGATGTGTGGTATATTTACCATTTCTGTAAGGCCAAGATAAAGGGGTATC 

AGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTATGACACTAAAGACCCA 

GCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAACA 

TTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTT 

CCCAGAGCATACAGCTTAGCAAAAAGACAGTGTTTGTGATGACAGACAAGTATGCAAAGACT 
G AAAAT T T T AAG AT AG C AT T T TAC TTGTCCCAT C AG AG G C T CAT G GAT G AAAAAG T T GAT G T 

GATTATCTTGATATTTCTTGAGAAGCCCTTTCAGAAGTCCAAGTTCCTCCAGCTCCGGAAAA 
GGCTCTGTGGGAGTTCTGTCCTTGAGTGGCCAACAAACCCGCAAGCTCACCCATACTTCTGG 
CAGTGTCTAAAGAACGCCCTGGCCACAGACAATCATGTGGCCTATAGTCAGGTGTTCAAGGA 
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FIGURE 209 



MVFPMWTLKRQILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGG 
IPTNTTNLTLTINHIPDISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRLQIKPRSFS 
GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYR 
NPCYVSYSIEKDAFLNLTKLKVLSLKDNNVTAVPTVLPSTLTELYLYNNMIAKIQEDDFNNL 
NQLQILDLSGNCPRCYNAPFPCAPCKNNSPLQIPVNAFDALTELKVLRLHSNSLQHVPPRWF 
KNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 
KILRIRGYVFKELKSFNLSPLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKIS 
PS GDS S E VG FC SNART S VE S YE PQVLEQLHYFRYDKYARSCRFKNKEAS FMS VNESCYKYGQ 
TLDLSKNSIFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLH 
STAFEELHKLEVLDISSNSHYFQSEGITHMLNFTKNLKVLQKLMMNDNDISSSTSRTMESES 
LRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNLSL 
AKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 
DAFQLRYLDLS SNKIQMIQKTS FPENVLNNLKMLLLHHNRFLCTCDAVWFVWWVNHTEVTI P 
YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVW 
YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEE 
RDWLPGQPVLENLSQSIQLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 
KPFQKSKFLQLRKRLCGSSVLEWPTNPQAHPYFWQCLKNALATDNHVAYSQVFKETV 
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FIGURE 71 OA 



n r a"» i^a G ^ AGAG £ J G T S E G A J AA ^! J T C AT CAC A C AC AT AAC GAAT GAAT CAT T T C AAGG 

;aacgga2 
:aaaaaac 

:CAGAGTC 

:atttca£ 

'TTGCGAG 
'CAC TAT C 
'TTTCTGA 
'AACATTA 
ICTTGTGA 



AAA. 
CAG 
ATT 
TTG 1 
CAC 
TTTi 
AAG 



^^SSSr^J^IE^^^^TATCACAGACGGGGCATTScTCAfiCCTAAAAAAC 

TGGTTTGCCAGAGTC 
AAGAGGGCATTTCAA 
AACAAAGTTTGCGAG 
GTTGCTATCACTATC 
GCAAACTTTTTCTGA 



;^?^?»J*^S^I?^?^?^ GTTA ^CCAAATACCCTCTGGT?TGCCAGAGTC 

AAAGAGG 

TAACAAA 

GCAACAGCCAGA; C AAa¥aC~a™ 



aI^S?^SJS gatcttg ^ ttc ^ ctatttagt gggagaaatagtctctggggcattttt 

AACGATGCTGCCCCGCTTAGAAATACTTGACTTGTCTTTT^ 



An Ann TTi-ri: ^ o^^J ^ G * AGAAA ? J ^ ? J ? J^^-C TTTTGTCTC tac gggcattgcat t ta 

AAA 
AAA 
GAT 
AAC 



I 



^ GAC ^ C IE G ™ T ? CTAGT GCTCTTACTGAATTGTCCGACTTGGAAGTTC^^ 

A 
A 



a a am ^ £ T AT AAT T CAC AC T AT T T C AGAAT AG CAG G C G T AAC ACAT CAT C TAGAAT T T AT T C A 
^ ^ CACAAAT C T AAAAG TTT TAAAC T T GAGC CACAACAACAT T TATA^T TTAACAG A TA 



CTC 



^2^ CCC ??I GGC I?? CTTTCTG AAGTCAGTAGTCTGAAGCACCTCGATTTAAGTTCCAA^ 

TATGTT 
GGATGG 
GGGGAT 



lTGG 
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FIGURE 21 OB 

CTGGGTCCTCTGCTCAGGGTGTCTCAGAGGCTGCAATGTAGGTGTTCACCAGAGACATAGGC 
ATCACTGGGGTCACACTCATGTGGTTGTTTTCTGGATTCAATTCCTCCTGGGCTATTGGCCA 
AAGGCTATACTCATGTAAGCCATGCGAGCCTCTCCCACAAGGCAGCTTGCTTCATCAGAGCT 
AG C AAAAAAG AG AG G T T G C T AG C AAG AT GAAG T C AC AAT C T T T T G TAAT C GAAT C AAAAAAG 
TGATATCTCATCACTTTGGC CAT AT TCTATTTGT TAG AAG TAAAC CAC AGG T C C CAC C AG C T 
CCATGGGAGTGACCACCTCAGTCCAGGGAAAACAGCTGAAGACCAAGATGGTGAGCTCTGAT 
TGCTTCAGTTGGTCATCAACTATTTTCCCTTGACTGCTGTCCTGGGATGGCCTGCTATCTTG 
ATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACCATCTTAGCAGTTGACCTAA 
C ACATC T T C T T T T C AA.TAT C TAAGAACTT T TGCCAC TGTGAC TAATGG T CCTAATATTAAGC 
TGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCG 
G T T T T AT T T AC AG T T G C T T T T AC AAAT AT T T G C T G T AAC AT T T GAC T T C T AAG G T T T AGATG 
C C AT T T AAG AA.C T G AG AT G G AT AG C T T T TAAAG C AT C T T T T AC T T C T T AC C AT T T T T T AAAA 
GTATGCAGCTAAATTCGAAGCTTTTGGTCTATATTGTTAATTGCCATTGCTGTAAATCTTAA 
AAT GAAT GAAT AAAAAT G T T T CAT T T T AC AAAAAAAAAAAAAAAA 
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E1QUBEJZ11 



MENMFLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAZCSNRRLQEVPQTVG 

KYVTELDLSDNFITHITNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNL 

KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKV 

CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 

TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 

PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 

HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 
KD TRQS YANS S S FQRH I RKRRS TDFE FDPHSNFYHFTRPL I KPQCAAYGKALDLS LNS I FFI 

GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 

LDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 

DILWNDDDNRYISIFKGLKNLTRLDLSLNRLKHIPNEAFLNLPASLTELHINDNMLKFFNWT 

LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 

SNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASP 

GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMIAAI^HLFYWDWFIYWCIAKVK 

GYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 

NLMQSINQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVIIFILLEPVLQHSQYLRL 
RQRICKSSILQWPDNPKAEGLFWQTLRNWLTENDSRYNNMYVDSIKQY 
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FIGURE 212 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 

CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA . 

CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 

AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 

TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 

GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 

GGCCACCCCGCCTGGAGGCACAGGCC MS AGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 

CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 

CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 

CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 

CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 

CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 

GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 

GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 

CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 

CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 

GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 

CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 

TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 

GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCG3S^CTGCCCAGCGCCCCAGGCTG 

GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 

AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 

TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 

CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 

AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 

GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 

CGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAAT 
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FIGURF ?14 



CGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCICATCTTCTCCCTAr.r 
GAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 



:tcctgcccctgtccgggggatgactgattc 

GGCCACCCCGCCTGGAGGCACAGGCCAISAGGGGC 

TGGGCGGCACAGAGCACGCCTA 



AGTGGAT 

AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCrrrrarr-n 
CACAGGTGGCCCCCACCACCCGGAGGAGCAGC ^^^^ 

TCCTCCGCCAGGCCACCCAGAGGAGAAi 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAG 

^^^^^^^^^^ 
CGCG^^^^^^^^^^^^^^^^^^^^^^^ ^^^^^^^^^GGC^A 

^^^^^^^^^^ 

T g TCT gcagagggJ™^^ 



ACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGt^GMGCTGGAGTCCMGGTGG^CTGC^ 

ggaggagaagcxgcagc^gcxgggcgcagxg^^ 



T 



ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 



TTCCAGCAGCTCGGCCGCATCGAC 



tccctgagcgaggagatttccxtcctggaggagcaggtggggtgctggtcgtgcaagaaa. 




ga 

CGCCTGrrr » , r* „ TGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
AG C " 

gggctgggatcttctgt™^ ICCTCGGGAGGCTCC ^agaccctg G gat G ggat 

GATCTTCTCTGTGAATCGACCCGTGGCTACCCCCAGCCTGGCTACCGCAACGGCA 



tcgcaaggcgaggtggaccctcaggtgagggaaggtaggagctccctggtggagcctgggac 

CCCCAGCACAATAAAAATGAAACGTG^^^^^^^^^^^^^^^^^^^ GG ^ GG '^ GGG ^' G ^ G 



CCATGGCACAGGCCAGGCA 
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FIGURE 215 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 




BNSDOCID: <WO 9946281A2_IA> 



* 




PCT/US99/05028 



WO 99/46281 



FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGACAGGCCAGGCA 

GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 

TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 

CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 

CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 

CTGGAGGCACAGGCC^EfiAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 

AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 

GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 

GGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 

GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 

CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAG 

CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGA 

ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 

GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 

CCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 

GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 

GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 

TTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCT 

GGGGTCCTGCTCCTGCAAGAAAGACTCG2S&CTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 
TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCG 
GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 
GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 
CCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGGTAC 
GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGG 

CCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTG 
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FIGURE 218 



GGTTGCCACAGCTGGTTTAGGGCCCCGACCACTGGGGCCCCTTGTCAGGAGGAGACAGCCTC 
CCGGCCCGGGGAGGACAAGTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTC 
CGTTTCCTGCCGTCAGCTGCCGGCCGAGTTGGGTCTCCGTGTTTCAGGCCGGCTCCCCCTTC 
CTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGGAGATTGTCTTCCAGGGCTAGCAA 
TTGGACTTTTGATGATGTTTGACCCAGCGGCAGGAATAGCAGGCAACGTGATTTCAAAGCTG 
GGCTCAGCCTCTGTTTCTTCTCTCGTGTAATCGCAAAACCCATTTTGGAGCAGGAATTCCAA 

T C A2£T CTGTGATGGTGGT G AGAAAGAAGG T GAC AC G GAAAT GG GAGAAAC T C C C AG GCAG G 
AACACCTTTTGCTGTGATGGCCGCGTCATGATGGCCCGGCAAAAGGGCATTTTCTACCTGAC 
CCTTTTCCTCATCCTGGGGACATGTACACTCTTCTTCGCCTTTGAGTGCCGCTACCTGGCTG 
TTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCTTCCTTTTCTCCATGGCTACA 
CTGTTGAGGACCAGCTTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATGAAGCAGC 
TTTCATAGAAATGGAGATAGAAGCTACCAATGGTGCGGTGCCCCAGGGCCAGCGACCACCGC 
CTCGTATCAAGAATTTCCAGATAAACAACCAGATTGTGAAACTGAAATACTGTTACACATGC 
AAGATCTTCCGGCCTCCCCGGGCCTCCCATTGCAGCATCTGTGACAACTGTGTGGAGCGCTT 
CGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTACCGCTACTTCTACC 
TCTTCATCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCTTCAACATCGTCTATGTG 
GCCCTCAAATCTTTGAAAATTGGCTTCTTGGAGACATTGAAAGAAACTCCTGGAACTGTTCT 
AGAAGTCCTCATTTGCTTCTTTACACTCTGGTCCGTCGTGGGACTGACTGGATTTCATACTT 

TCCTCGTGGCTCT C AAC C AG AC AAC CAAT GAAGACAT CAAAGGAT CAT GGACAGGGAAGAAT 

CGCGTCCAGAATCCCTACAGCCATGGCAATATTGTGAAGAACTGCTGTGAAGTGCTGTGTGG 

CCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTATTTTGCCACTGGAGGAAAGTGGAAGTC 

GACCTCCCAGTACTCAAGAGACCAGTAGCAGCCTCTTGCCACAGAGCCCAGCCCCCACAGAA 

CACCTGAACTCAAATGAGATGCCGGAGGACAGCAGCACTCCCGAAGAGATGCCACCTCCAGA 

GCCCCCAGAGCCACCACAGGAGGCAGCTGAAGCTGAGAAG2&SCCTATCTATGGAAGAGACT 

T T TGT TTG TGTT TAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAG 

AGCAAGTAAGCTGTCCCTTTTAACTGTTTTTCTTTGGTCTTTAGTCACCCAGTTGCACACTG 

GCATTTTCTTGCTGCAAGCTTTTTTAAATTTCTGAACTCAAGGCAGTGGCAGAAGATGTCAG 

TCACCTCTGATAACTGGAAAAATGGGTCTCTTGGGCCCTGGCACTGGTTCTCCATGGCCTCA 

GCCACAGGGTCCCCTTGGACCCCCTCTCTTCCCTCCAGATCCCAGCCCTCCTGCTTGGGGTC 

ACTGGTCTCATTCTGGGGCTAAAAGTTTTTGAGACTGGCTCAAATCCTCCCAAGCTGCTGCA 

CGTGCTGAGTCCAGAGGCAGTCACAGAGACCTCTGGCCAGGGGATCCTAACTGGGTTCTTGG 

GGTCTTCAGGACTGAAGAGGAGGGAGAGTGGGGTCAGAAGATTCTCCTGGCCACCAAGTGCC 

AGCATTGCCCACAAATCCTTTTAGGAATGGGACAGGTACCTTCCACTTGTTGTANNNNNNNN 

NNNNNNNNNNNNNNNNNNNTTGTTTTTCCTTTTGACTCCTGCTCCCATTAGGAGCAGGAATG 

GCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACT 

TAAACACTATCCCCTCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGG 

AAACCAAGGCACAGAGAGGCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAA 

AAAAATGCTAACCAGTTCTTCCATTAAGCCTCGGCTGAGTGAGGGAAAGCCCAGCACTGCTG 

CCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCTATGGTAACCACACTGGGG 

GCTTCCTCCAAGCCCCGCTCTTCCAGCACTTCCACCGGCAGAGTCCCAGAGCCACTTCACCC 

TGGGGGTGGGCTGTGGCCCCCAGTCAGCTCTGCTCAGGACCTGCTCTATTTCAGGGAAGAAG 

ATTTATGTATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAG 

GGTCCTGTCTGGATGACTTATGCGGTGGGGGAGTGTAAACCGGAACTTTTCATCTATTTGAA 

GGCGATTAAACTGTGTCTAATGCA 
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FIGURE 22Q 



AAAAC C C T G TAT T T T T T AC AAT GC AAATAG AC AATNANC C T GGAGG T C T T TGAAT TAG G TAT 
TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 
CCCACAGAGCNCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTA 

CCGCTANTTCTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 
ACATCGT 
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FIGURE 271 

GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 
TCTATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGA 

TTGGGCAATCTTCAC 

GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 
CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 
GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 
CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 
AGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 
CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 
T T CAGATAT CTCCAT TAAT GAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 

CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 
GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 
CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 
TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 
TC AGC AG T C CC C T CAGCAGAAT TCCAGT GGTACAAGGATGACAAAA.GAC T GAT TGAAGGAAA 

GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 
AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 
ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 




'GTGAGTGCC 

ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 
CGACAGCAACCAATCAGATATATACAAATGAAA.TTAGAAGAAACACAGCCTCATGGGACAGA 
AAT T T GAGGGAGGGGAACAAAGAATACT T TGGGGGGAAAAGAGT T TTAAAAAAGAAAT TGAA 

AATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGC 

ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 

GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 

AATTCAATCAGTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 

GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 
AAAAA 
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FIGURE 222 



MKTIQPKMHNS ISWAIFTGLAALCLFQGVPVRSGDATFPKAMDNVTVRQGESATLRCTIDNR 
VTRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQNVDVYDEGPYTCSVQTDNHPK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 
EIQGITREQSGDYECSASNDVAAPVVRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAV 
PSAEFQWYKDDKRLIEGKKGVKVENRPFLSKLIFFNVSEHDYGNYTCVASNKLGHTNAS IML 
FGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 




BNSDOCID: <WO 9946281 A2_IA> 



WO 99/46281 




* 



PCT/US99/05028 




GAAAAAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTC 
TTGA^CCOGg' ^ GTGACGGTCCGGCACGGG5 ^^«ACCCTC fl GGTGCACTA 

gIca^gg TG cc^ C GGTGGCCTGGCTAAACCG ^^catcctctatgctgggaat 

GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 

cgagatccagaacgtggatgtgtatgacgagggcccttacacctgctcggtg^gaSga^ 

accacccaaagacctctagggtccacctcattgtgcaagtatctcccaaaattgta^ga" 
tcttcagaxaxctccaxtaatgaagggaacaa^xagcctcagc tgcaxag^™ 



■ <WO 9946281 A2_lA> 





WO 99/46281 ^PF PCT/US99/05028 

FIGURE 2?A 

ATGGCTGGTGACGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTG 

CCGGGAGCCCTGAATCACCGCCTGGCCCGACTCCACC^TGAACGTCGCGCTGCAGGAGCTGG 

GAGCTGGCAGCAACGTGGGATTCCAGAAGGGGACAAGACAGCTGTTAGGCTCACGCACGCAG 

CTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCTGCACTGCTTCTGGGCTGCCTTGT 

GGCCCTAGGGGTCCAGTACCACAGAGACCCATCCCACAGCACCTGCCTTACAGAGGCCTGCA 

TTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTT 

TACCAGTTCTCCTGTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTG 
GAAC AC C T T C AAC AG C C T C T G G G AC C AAAAC C AGG C CAT AC T G AAG C AC C T G C T T GAAAAC A 

CCACCTTCAACTCCAGCAGTGAAGCTGAGCAGAAGACACAGCGCTTCTACCTATCTTGCCTA 

CAGGTGGAGCGCATTGAGGAGCTGGGAGCCCAGCCACTGAGAGACCTCATTGAGAAGATTGG 

TGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTGAAGGCAGTAG 

CAGGGACCTACAGGGCCACCCCATTCTTCACCGTCTACATCAGTGCCGACTCTAAGAGTTCC 

AACAGCAATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTT 

AAACAGAACTGCCAATGAGAAAGTGCTCACTGCCTATCTGGATTACATGGAGGAACTGGGGA 

TGCTGCTGGGTGGGCGGCCCACCTCCACGAGGGAGCAGATGCAGCAGGTGCTGGAGTTGGAG 

ATACAGCTGGCCAACATCACAGTGCCCCAGGACCAGCGGCGCGACGAGGAGAAGATCTACCA 

CAAGATGAGCATTTCGGAGCTGCAGGCTCTGGCGCCCTCCATGGACTGGCTTGAGTTCCTGT 

CTTTCTTGCTGTCACCATTGGAGTTGAGTGACTCTGAGCCTGTGGTGGTGTATGGGATGGAT 
TAT T T G C AG C AG GT G T C AGAG C T CAT C AAC C GC AC GGAAC CAAGCAT C C T GAACAAT TACC T 

GATCTGGAACCTGGTGCAAAAGACAACCTCAAGCCTGGACCGACGCTTTGAGTCTGCACAAG 
AGAAGCTGCTGGAGACCCTCTATGGCACTAAGAAGTCCTGTGTGCCGAGGTGGCAGACCTGC 

ATCTCCAACACGGATGACGCCCTTGGCTTTGCTTTGGGGTCACTCTTCGTGAAGGCCACGTT 
T GAC C G G C AAAG CAAAGAAAT T G C AGAGGGGAT GAT CAGC GAAATC C G GAC C GC AT T T GAGG 

AGGCCCTGGGACAGCTGGTTTGGATGGATGAGAAGACCCGCCAGGCAGCCAAGGAGAAAGCA 
^2S CATCTATGATATGATTGGTTTCCCAGACTTTAT CCTGGAGCCCAAAGAGCTGGATGA 
TGTTTATGACGGGTACGAAATTTCTGAAGATTCTTTCTTCCAAAACATGTTGAATTTGTACA 
ACTTCTCTGCCAAGGTTATGGCTGACCAGCTCCGCAAGCCTCCCAGCCGAGACCAGTGGAGC 
ATGACCCCCCAGACAGTGAATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGC 
TGGCATCCTGCAGGCCCCCTTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCA 
TCGGTGTGGTCATGGGCCATGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGAC 
AAAGAAGGGAACCTGCGGCCCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACAC 
GGCCTGCATGGAGGAACAGTACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCC 

AGA S G "S GGGGAGAACATm ^ 

GCATGGCTGAGAAAGCATGGGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCA 
G " G "E"E GTGGGATTTGCCCAGGTGTGGT GCTCGGTCCGCACACCAGAGAGCTCTCACG 
AGGGGCTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAAC 
TCCCGTGACTTCCTGCGGCACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCT 




GCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGT 
G I AG ^ GG ?I CCCCCCCACAGGTGACAT GAGTACAGACCCTCCTCAATCACCACATTG 
SJ GC J TTGGGGGTGCCCCTGCCTCCAGCAGAGC CCCCACCATTCACTGTGACATCTTTCCGT 
GTCACCCTGCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCC 
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FIGURE ?7K 

MNVALQELGAGSNVGFQKGTRQLLGSRTQLELVLAGASLLLAALLLGCLVALGVQYHRDPSH 

STCLTEACIRVAGKILESLDRGVSPCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 

ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 

FMEVLKAVAGTYRATPFFTVYISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEKVLTAY 
LDYMEELGMLLGGRPTS TRE QMQQVLELE I QLANI T VP QDQRRDEE K I YHKMS I SELQALAP 

SMDWLEFLSFLLSPLELSDSEPVWYGMDYLQQVSELINRTEPSILNNYLIWNLVQKTTSSL 
DRRFESAQEKLLETLYGTKKSCVPRWQTCISNTDDALGFALGSLFVKATFDRQSKEIAEGMI 
SEIRTAFEEALGQLVWMDEKTRQAAKEKApAIYDMIGFPDFILEPKELDDyYDGYEISEDSF 
FQNMLNLYNFSAKVMADQLRKPPSRDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNH 
PKALNFGG I GWMGHELTHAFDDQGRE YDKEGNLRPWWQNESIAAFRNHTACMEEQYNQYQV 

NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCS 
VRTPESSHEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 
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FIGURE 79fiA 



GCCCGGCCCTCCGCCCTCCGCACTCCCGCCT 
CCTCCCTCC 



a ^v^c i <-<~^C-ACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCrrTrrTr 

CTGCTGCTCCTCGGGCTGCTGCTGCTCGGCTCCCGGCCGGCCCGCGGCGrrR'"- rrarnrrr 
CCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCT^CCGTTC^ 

gggcgcccgggggaggcgcgggcggggagtcgggctcggggcgagtcagcgccaS^ 
ggcgcggtgcctgggacccgggacccgcgggcagcccccggggcggcacacggcgcgagctg 



c gcctgcgaggcgcagtggggtcgccgtaccaggggccctggcaggg?cagc^ 



TCAAACCAGAGTGCCCAACCCCGGCCTGTG! 



IGCAGCCGCGCCAGCTGCCGGGACACTGCTGC 



a^gIg^c^ 

GCCCTACCAGG 
C C C AC C C AAG A 
CCTTAGGGCAG 
GGGGGCCTCTG 
G AAG GCCCC C A 

ctccttgcattttttgctgctcttccgaggccttgcaggactaacccaggt 



GCCCTACrz,r.r. B SS^S^SSSI"^SE????? TTCTCTATCTC CTACAGGCGGCTGGACC 

CCCACCC. 
CCTTAGG' 

GAAGGCCCCCACCAGCAGGGCGTAGGGGGCATCACCCTGCTCACTCTCA 



CAGGATCCGCTTCTCAGACTCCAATGGCAGTGTCCTGTTTGAGCACCCTGCArrr 

-^^^^GATGGCCTGGTCTGTGGGGTGTGGCGGGCAGTGCCTCGGTTGTrTrTr^nS 
^T^GGCAGAACAGCTG^^ 

GGGGGCCTCTCATCCGGCACCGGGCCCTGTCCCCAGAGACCTTCAG' 



_ _ GGTGCTGCCCAACCTGACAGTCCAGGAGATGGACTGGCTGGTGCTGGG 



G 
Ci 
A< 

c< 

C" 

c* 

c< 

G( 

TCCTTGGGCCTCCTGGAACGCCAGGGCCTCGGCGGCTG^ 



ctggaaaccaagcctc 
ctcccctgcccccagg 
tgctgcagaatgagct 
gggcaacgtggctgcc 

ccccctgtgaagagccaaVga^ 

gcactatgaagtgctgctggctgggcttggtggctcagaacaIggcac?g^ 



C S^S^^E?5I AGGGACAA CCAGTGAGGTGGTGGCCATGACACTGGAAACCAAGCCTC 

TCCCCTG 

GCTGCAG. 

rTr _------"^ ~ V*~ w x ^^^^^^^utJTTCGGGGGCAACGTGGCTGCC 

G J GGGG J AG J GTGGGGCATA GCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCC 



C " CC I GAACGTGGGCACC ^GGACTTCCCAGACGGA<^ 



£c^gS<£ GCGCCCGCCA ^ 

S?GGTGACCGG^S^? GGCTCCCGflCCAGTG " G ScTG???GC^ 
T?AA?^G?i A |^r? C ? GGCRGCGGG ^ CGCGGTGGCACCCCGTTGT ^^CC^^ 

AACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGGGGACCCCATGCAGGCTGATGGGCCCCGG 



CACTGTCC 
CGTAAGTG. 
CTGATCAGI 
.TTCCAATC< 

aattaagcaacgagatgaaggtcacccagcVgVgVgc^ 



AGG E A 5 GG . G . TGCCTCA CTGTGAGCGGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGG 

■CGGCGGCGTA 

GGACTCCTGA 



GTCAGCA^GCTGTGAGT^ 

TCACTrTr5^I^ G ^™S GCTCACCTGCCTGTGGGAC 

I CA SI G IE G ? AGGAACAGTGCA GCCTGCCTCACAAGTGCCATTCCAATCCACCCTCACAGC 



ATCAGGGAAGGGAGCAC 



VO 9946281 A2 J A> 





WO 99/46281 PCT/US99/05028 



FIGURE 79fiR 

CTTTCTGGGACCAA.GGCAGGGATGCTTTGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCAC 

TCCCTCTCCCCTCCTCCAACATTCCCTCCCTTCTGTCTCCAGCAGCCCCAGAGACCAGAACT 
GAT C C AG AG C T G G AG AAAG AAG C C G AAG GC T C T T AGG G AG C AG C CAG AG G G C C AAG T GACCA 

AGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGG 

GAAGCCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAAC 

CACAGCTCCACAAGGGGGAGAGGCAGCTGGGCCAGACCGAGGTCACAGCCACTCCAAGTCCT 

GCCCTGCCACCCTCGGCCTCTGTCCTGGAAGCCCCACCCCTTTCTTCCTGTACATAATGTCA 

CTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACCAAGGGCCCCGGACACT^CA^ 

CTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAACATTTCT 

TTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGG 

AGAAGGGGCNGAGAGTAGGAGGTGAGAGAGAGGAGCTCTGACACTTGGGGAGCTGAAAGAGA 

CCTGGAGAGGCAGAGGATAGCGTGGCNNTTGGCTGGCATNCCTGGGTTCCGCAGAGGGGCTG 

GGGATGGTTCTTGAGATGGTCTAGAGACTCAAGAATTTAGGGAAGTAGAAGCAGGATTTTGA 
CTCAAGTTTAGTTTCCCACATCGCTGGCCTGTTTGC^ 

AGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCTTTCTTTr 
CCTCCCCTCCCCTCCCCTCCCCTCCCCTCC ICTTTC 
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FIGURE 227 



GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 

CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 

TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGG 

GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTAC 

TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 

TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 

GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 

GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 

CAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 

GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 

AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 

GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 

CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 

GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 

ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 

TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 

CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 

TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 

GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 

AGATAGGCTGATAACAAAGCTCACAAGAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 
AGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 
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ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 

TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 

TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 

CCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 

TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 

TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 

GGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 

GCCCAGGTGTGGTGCTCGGTGCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 

CGACAGCCCTGGCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTGCCGTGACTTCCTGCGGG 

ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 

TGGATCAGGGGAGAAATGGGCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 

TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 

GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 

GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 

GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGICCCCAGGCICACT 

GAGCGTGGCGGCCATGGGGCCTGCCGTGCCTGCCCGACTGTGACCCACAGGCCTGGGTGGTG 

TACCTGCTGGACTTCTCCCCAGGCTCACTCAGTGCGCAGTTAGGGGTGGAGTCAGCTCTGTC 

TGGCTCACCCTGACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 

CTGCTGCTGACCTTGACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCGTCGTGC 

TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 

GTGTTAGGGCACAAGCCTTAGCAAAIGATTGATTCTGCGTGGACAAAGCAGGAAAGCAGATA 

GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 
TGGCCCTTATAGGACC 
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